点卷积神经网络：3D点云语义分割与对象识别

需积分: 50 118 浏览量更新于2024-09-07 收藏 7.34MB PDF 举报

"pointwise卷积神经网络用于3D点云的深度学习，旨在实现语义分割和对象识别" 本文介绍了一种针对3D点云数据的新型卷积神经网络（CNN），称为“pointwise卷积神经网络”，它专门设计用于处理3D点云的语义分割和对象识别任务。点云数据，即由三维空间中的离散点表示的几何形状，近年来在深度学习领域引起了广泛的研究兴趣。然而，将卷积神经网络充分应用到点云处理上仍然存在挑战。点wise卷积是该网络的核心，这是一种新的卷积运算符，能够逐点对点云进行操作。这一创新使得网络能够直接处理不规则且无序的点云数据，而传统CNN通常适用于结构化的网格数据，如图像。尽管网络的设计简单易实现，但在语义分割和对象识别任务上却能展现出与现有方法竞争的准确性。深度学习在3D数据上的应用已经推动了场景理解、形状补全和形状匹配等领域的显著进步。其中，场景理解被认为是最重要的任务之一，因为它涉及识别和解释环境中的各个物体和其相互关系。点wise卷积神经网络的出现，为3D数据的深度学习提供了新的可能性。点wise卷积的工作原理是通过在每个点上应用滤波器，以提取局部特征。这种操作可以保留点云的原始拓扑信息，并且能够在不增加复杂性的前提下，有效地扩展特征表达能力。在语义分割中，网络可以学习将点云中的每个点分配到预定义的类别中，而在对象识别任务中，网络则负责识别出点云表示的具体物体。为了训练这样的网络，通常需要大量的标注数据，包括3D点云及其对应的语义标签或对象类别。数据集的构建和标注是一项耗时的任务，但随着3D扫描技术和自动标注技术的发展，这个问题正在逐渐得到解决。此外，优化网络的参数设置和结构以适应特定任务，也是研究中的关键部分。点wise卷积神经网络的另一个优势在于其可扩展性。它可以与其他深度学习架构，如递归神经网络（RNN）或Transformer结合，以处理更复杂的3D数据序列。此外，这种网络还可以应用于自动驾驶、机器人导航、虚拟现实和增强现实等领域，因为这些领域都需要对环境的精确理解和解析。 pointwise卷积神经网络为3D点云的深度学习提供了一种新的有效工具，它通过逐点卷积解决了处理非结构化数据的难题，并在语义分割和对象识别任务中展现了强大的性能。随着点云数据处理技术的不断进步，我们可以期待在未来看到更多基于pointwise卷积的创新应用。

Pointwise Convolutional Neural Networks

Binh-Son Hua Minh-Khoi Tran Sai-Kit Yeung

The University of Tokyo Singapore University of Technology and Design

Abstract

Deep learning with 3D data such as reconstructed point

clouds and CAD models has received great research inter-

ests recently. However, the capability of using point clouds

with convolutional neural network has been so far not fully

explored. In this paper, we present a convolutional neural

network for semantic segmentation and object recognition

with 3D point clouds. At the core of our network is point-

wise convolution, a new convolution operator that can be

applied at each point of a point cloud. Our fully convolu-

tional network design, while being surprisingly simple to

implement, can yield competitive accuracy in both semantic

segmentation and object recognition task.

1. Introduction

Deep learning with 3D data has received great research

interests recently, which leads to noticeable advances in

typical applications including scene understanding, shape

completion, and shape matching. Among these, scene un-

derstanding is considered as one of the most important tasks

for robots and drones as it can assist exploratory scene nav-

igations. Tasks such as semantic scene segmentation and

object recognition are often performed to predict contex-

tual information about objects for both indoor and outdoor

scenes.

Unfortunately, deep learning in 3D was deemed difﬁcult

due to the fact that there are several ways to represent 3D data

such as volumes, point clouds, or multi-view images. Vol-

ume representation is a true 3D representation and straight-

forward to implement but often requires a large amount of

memory for data storage. By contrast, multi-view represen-

tation is not a true 3D representation but shows promising

prediction accuracy as existing pre-trained weights from 2D

networks can be utilized. Among such representations, point

clouds have been the most ﬂexible as they are compact and

This work was done when Binh-Son Hua was a postdoctoral researcher

in Singapore University of Technology and Design in 2017.

Figure 1: Pointwise convolution. We deﬁne a new convo-

lution operator for point cloud input. For each point, near-

est neighbors are queried on the ﬂy and binned into kernel

cells before convolving with kernel weights. By stacking

pointwise convolution operators together, we can build fully

convolutional neural networks for scene segmentation and

object recognition for point clouds.

could be exported from a wide range of CAD modelling

and 3D reconstruction software. However, the capability of

using point clouds with neural network has been so far not

fully explored.

In this paper, we present a convolutional neural network

for semantic segmentation and object recognition with 3D

point clouds. At the core of our network is a new convolution

operator, called pointwise convolution, which can be applied

at each point in a point cloud to learn pointwise features.

This leads to surprisingly simple and fully convolutional net-

work designs for scene segmentation and object recognition.

Our experiments show that pointwise convolution can yield

competitive accuracy to previous techniques while being

much simpler to implement. In summary, our contributions

are:

•

A pointwise convolution operator that can output fea-

tures at each point in a point cloud;

•

Two pointwise convolutional neural networks for se-

mantic scene segmentation and object recognition.

arXiv:1712.05245v2 [cs.CV] 29 Mar 2018

下载后可阅读完整内容，剩余9页未读，立即下载

you62580

粉丝: 0
资源: 1

点卷积神经网络：3D点云语义分割与对象识别

37. Depthwise卷积与Pointwise卷积 - 干巴他爹的小本本 - CSDN博客1

卷积神经网络

基于分组模块的卷积神经网络设计.pdf

深度学习中pointwise卷积的作用

point-wise卷积

depth-wise卷积

pointwise convolution和1x1卷积

depthwise convolution and pointwise convolution

pointwise convolution 参数量

深度可分离卷积神经网络

最新资源