点云分类的图卷积神经网络研究

版权申诉

16 浏览量更新于2024-08-16 收藏 538KB PDF 举报

"AGRAPH-CNN FOR 3D POINT CLOUD CLASSIFICATION" 在当前的计算机视觉领域，3D点云数据的处理与分析已经成为一个重要的研究方向。传统的卷积神经网络（CNNs）主要针对结构化的网格数据，如图像，但面对非结构化的3D点云数据，它们的能力有限。这就是图卷积神经网络（Graph Convolutional Neural Networks, Graph-CNNs）发挥作用的地方。Graph-CNNs能够处理以图形式存在的数据，这恰好适合没有固定顺序且拓扑结构不规则的3D点云。点云数据是从三维空间中物体表面采样得到的离散点集，它们通常没有自然的排列顺序，且每个点的邻接关系（邻居数量）可能不同。这种复杂性使得传统CNN难以直接应用。论文"AGRAPH-CNN FOR 3D POINT CLOUD CLASSIFICATION"提出了一个名为PointGCN的新型Graph-CNN架构，专门用于3D点云数据的分类。 PointGCN的核心在于结合了局部图卷积和两种类型的图下采样操作（也称为池化）。局部图卷积允许模型捕捉点云中的局部结构信息，而图下采样则有助于减少计算量并保持关键特征，这在处理大规模点云时尤其重要。这种设计使得PointGCN能够在不丢失重要信息的情况下，有效地对点云进行降维。在3D对象分类基准ModelNet上的实验结果显示，PointGCN的性能表现优秀，与现有的竞争方案相比具有更高的稳定性和准确性。Index Terms包括：图卷积神经网络，这表明该研究重点探讨了如何将图卷积的概念应用于3D点云的深度学习任务中。通过Graph-CNNs，研究人员能够探索点云的局部结构，并通过学习这些结构的表示来识别和分类不同的3D形状。此外，点云的图下采样操作进一步提高了模型的效率，使得在大型点云数据集上训练和推理成为可能。这篇论文展示了Graph-CNNs在处理3D点云数据时的独特优势，为未来在3D计算机视觉领域的深度学习研究提供了新的思路和方法。通过PointGCN的架构，可以更有效地挖掘点云数据中的拓扑信息，实现高精度的3D对象分类。这不仅有助于提升现有3D识别技术的性能，也为自动驾驶、虚拟现实、工业检测等实际应用提供了强大的工具。

A GRAPH-CNN FOR 3D POINT CLOUD CLASSIFICATION

Yingxue Zhang and Michael Rabbat

McGill University

Montreal, Canada

ABSTRACT

Graph convolutional neural networks (Graph-CNNs) extend tradi-

tional CNNs to handle data that is supported on a graph. Major chal-

lenges when working with data on graphs are that the support set (the

vertices of the graph) do not typically have a natural ordering, and in

general, the topology of the graph is not regular (i.e., vertices do not

all have the same number of neighbors). Thus, Graph-CNNs have

huge potential to deal with 3D point cloud data which has been ob-

tained from sampling a manifold. In this paper we develop a Graph-

CNN for classifying 3D point cloud data, called PointGCN

. The

architecture combines localized graph convolutions with two types

of graph downsampling operations (also known as pooling). By

the effective exploration of the point cloud local structure using the

Graph-CNN, the proposed architecture achieves competitive perfor-

mance on the 3D object classiﬁcation benchmark ModelNet, and our

architecture is more stable than competing schemes.

Index Terms— Graph convolutional neural networks, graph

signal processing, 3D point cloud data, supervised learning

1. INTRODUCTION

With the advent of very large datasets and improved computational

capabilities, methods using convolutional neural networks (CNNs)

now achieve state-of-the-art performance on a variety of tasks, in-

cluding speech recognition and image classiﬁcation. Many emerging

applications give rise to data that may be viewed as being supported

on the vertices of a graph, and ﬁeld of graph signal processing (GSP)

has developed ﬁltering and other operations on graph signals [1, 2].

Data may either be naturally sampled on the vertices or edges of a

graph (e.g., ﬂows on a transportation network), or the data may sim-

ply be unstructured and a graph is imposed to capture the manifold

structure underlying the data (e.g., the 3D point clouds considered

in this paper). Unlike the domains encountered in more traditional

signal processing (e.g., 1D time-series, 2D images), general graph

topologies do not have the same regularity or symmetries, and so

there is not a unique, well-deﬁned notion of convolution on a graph.

This has motivated researchers to develop a variety of approaches to

convolutions on graphs, which can then be applied in graph-CNNs

and other graph-based signal processing architectures.

Bruna et al. [3, 4] ﬁrst proposed the idea of using a graph con-

volution deﬁned in the graph spectral domain together with a graph

multiresolution clustering approach to achieve pooling/downsampling.

Defferrard et al. [5] propose a fast localized convolution operation

by leveraging the recursive form of Chebyshev polynomials to both

avoid explicitly calculating the Fourier graph basis and to allow the

number of learnable ﬁlter coefﬁcients to be independent of the graph

Code is available at https://github.com/maggie0106/

Graph-CNN-in-3D-Point-Cloud-Classification

size. Atwood and Towsley [6] use a similar localized ﬁltering idea

but deﬁne the convolution process directly in the spatial domain by

searching the receptive ﬁled at different scales using random walk.

Graph kernels have also been applied to the graph classiﬁcation

task [7, 8] which aims to classify a graph based on its topology, as

opposed to classifying or otherwise processing signals on a graph.

However, Graph kernels suffer from quadratic training complexity

in the number of graphs [8].

The formulation of Graph-CNNs opens up a range of applica-

tions. Defferrard et al. [5] validate their model on an image classiﬁ-

cation task and demonstrate the effectiveness of Graph-CNNs. Kipf

and Welling [9] study the application of the Graph-CNNs to semi-

supervised learning. In this paper, we explore the application of the

Graph-CNNs in 3D point cloud data.

GSP techniques have been applied to process 3D point cloud

data, such as that obtained by light detection and ranging (LiDAR)

sensors. Rather than binning point clouds into voxels, graph-based

approaches ﬁt a graph with one vertex for each point and edges be-

tween nearby points, and then operate on the graph. The effective-

ness of GSP for processing 3D point cloud data has been demon-

strated in applications such as data visualization, in-painting, and

compression [10, 11, 12, 13].

In this work, we propose a Graph-CNN architecture called

PointGCN for classifying 3D point cloud data by exploring its local

structure encoded in the constructed graph. Unlike most previous

Graph-CNNs, in this setting both the signals and the graph structure

vary from input to input. The proposed architecture uses existing

graph convolution operation together with two types of speciﬁ-

cally designed pooling layers for point cloud data. The architecture

learns a latent signature summarizing each point cloud at different

receptive ﬁelds.

We achieve an average classiﬁcation accuracy comparable to the

state-of-the-art on the ModelNet benchmark, and the variance of the

proposed approach is substantially lower than existing point-based

classiﬁcation methods.

2. PROBLEM STATEMENT

We consider a classiﬁcation problem where we are given m labeled

training instances {(X

, y

)}, each composed of an input X

∈ X

and an output y

∈ Y. Our goal is produce a function y = f(X)

to predict the output y associated with a new, unseen input X. For

point-based 3D classiﬁcation problem, we consider the case where

the output space Y is ﬁnite (the classes), and each input X

is a set

of n points, {x

j,1

, . . . , x

j,n

} ⊂ R

Previous work has taken different approaches to classifying 3D

point clouds [14, 15, 16, 17, 18, 19, 20, 21], including rendering and

processing a collection of 2D images (projections of the points onto

an image plane from different perspectives), or binning the points

arXiv:1812.01711v1 [cs.CV] 28 Nov 2018

下载后可阅读完整内容，剩余4页未读，立即下载

普通网友

粉丝: 1263
资源:
5619

点云分类的图卷积神经网络研究

nebula-graph-3.2.0.el7.x86_64

neo4j-graph-data-science-1.6.1-standalone.zip

CIKM2019-graph-for-recommendation.pdf

nebula-graph-3.8.0.el7.x86-64.tar.gz

python3-subunit2sql-graph-1.9.0-3.el8.noarch.rpm

js-d3-flame-graph-4.0.7-1.el8.noarch.rpm

openstack-vitrage-graph-5.0.1-1.el8.noarch.rpm

openstack-vitrage-graph-5.0.0-1.el7.noarch.rpm

openstack-vitrage-graph-5.0.1-1.el7.noarch.rpm

boost-graph-1.66.0-10.el8.aarch64.rpm

最新资源