点云中多视图3D物体识别的快速与稳健方法

需积分: 10 34 浏览量更新于2024-09-09 1 收藏 5.8MB PDF 举报

"本文主要探讨了多视图3D物体识别在点云数据中的应用，提出了一种新的、快速且稳健的方法，通过将3D点云投影到多个2D深度图像，将3D识别问题转化为一系列2D检测问题，从而简化了处理复杂度，提升了性能并加快了识别速度，无需预先进行对象分割或检测器训练。实验表明，该方法在工业和街道数据扫描的实例上优于多种现有技术。" 在3D物体识别领域，点云数据的处理一直是一项挑战，这主要是由于数据的离散采样、遮挡等因素导致的。传统的3D物体识别方法通常需要预先进行物体分割和3D描述符的训练与匹配，这些过程既耗时又复杂，尤其是在处理大规模的工业或城市街景数据时更为明显。针对这一问题，本文提出了一个新的多视图3D物体识别策略。该策略的核心在于将3D点云数据投影到多个不同的2D视图中，生成一系列的深度图像。这样做可以将原本复杂的3D空间识别问题转换为相对简单的2D图像检测任务，从而降低了计算复杂性，提高了识别的稳定性，并显著提升了识别速度。值得注意的是，这种方法不依赖于对象分割或者专门的检测器训练，使得整个流程更加简洁高效。实验部分，作者对比了他们的方法与其他几种最先进的技术在工业和街道环境的点云数据上的表现，验证了新方法的优越性。这些实验结果进一步证实了多视图策略在处理3D点云数据识别问题上的优势，特别是在应对遮挡、噪声和大规模数据集时的鲁棒性。这项工作为3D物体识别提供了一个新的视角，通过多视图处理简化了点云数据的处理，为未来的研究和实际应用提供了重要的参考。这种方法可能对自动驾驶、机器人导航、无人机监测等领域具有广泛的应用潜力，因为它能够快速准确地识别复杂环境中的3D物体，从而提高系统的决策能力和安全性。

Fast and Robust Multi-View 3D Object Recognition in Point Clouds

Guan Pang

University of Southern California

gpang@usc.edu

Ulrich Neumann

University of Southern California

uneumann@usc.edu

Abstract

Recognition of three dimensional (3D) objects in point

clouds is a challenging problem. Existing methods often

require prior segmentation or 3D descriptor training and

matching, both time consuming and complex processes,

especially for large-scale industrial or urban street data.

We describe a new recognition approach that projects a

3D point cloud into several 2D depth images from multiple

viewpoints, transforming the 3D recognition problem into

a series of 2D detection problems. This method reduces

complexity, stabilizes performance, and signiﬁcantly speeds

up the recognition process, without any requirement for

object segmentation or detector training. Experiments

validate the superiority of our method over several state-

of-the-art methods on examples from industrial and street

data scans.

1. Introduction

3D object recognition (ﬁg. 1) in point clouds is a chal-

lenging problem due to discrete sampling, occlusions and

cluttered scenes. Many existing methods focus on small-

scale data [1, 2, 3, 4, 5, 6, 7, 8] using 3D descriptors. A

few others work with large-scale data, mostly urban street

scans [9, 10, 12, 13, 14, 15]. These methods utilize machine

learning to select the best description for a speciﬁc type

of 3D object, so they can be recognized reliably in a large

urban scene, and usually require prior segmentation of input

data. Relatively fewer take on industrial part recognition[14,

15], where objects are often more densely arranged, making

segmentation more difﬁcult. Regardless of domain focus,

most methods perform the recognition process in 3D, either

using 3D local descriptors [1, 9, 16, 2, 3, 4] or exhaustive3D

scanning-window search [14, 17]. Both approaches require

3D descriptor or detector training and are time-consuming

due to the 3-dimensional search. Large-scale industrial

or street data contain 100’s of millions or billions of 3D

points, motivating the search for fast and robust recognition

methods.

Two recent trends motivate our work. Growing avail-

Figure 1. Object recognition from 3D point cloud.

ability and use of 3D scanners has spurred interest in 3D

object recognition. Also, 2D object detection in images

has improved dramatically. These observations motivate a

transformation of the 3D object recognition problem into a

series of 2D detection problems. This 3D-to-2D strategy

is similar to those used for 3D object model retrieval [20,

21, 23], but our target is unsegmented noisy large-scale 3D

point cloud which is much more complex. Our algorithm

for 3D object recognition is based on multi-view projection,

ﬁrst projecting a 3D point cloud into 2D depth images from

multiple viewpoints. Objects are detected in each view

using gradient data, and the 2D detection results are fused

by 3D re-projection to determine object locations. This

algorithm reduces the search complexity from 3D to 2D,

while removing all requirements for object segmentation

or detector training. The multi-view projection process

also stabilizes performance in cluttered and occluded scenes

and provides rotation invariance. Our method is tested on

a combination of industrial data and street data [25, 13]

containing various types of objects and scene conditions. In

comparisons with state-of-the-art 3D recognition methods,

our method has competitive overall performance with one-

order of magnitude speed-up.

Our main contributions include:

• Transforming the 3D point cloud object recognition

problem into a series of 2D detection problems to

reduce search complexity.

• Employing multi-view projection to provide rotation

invariance and stabilize performance in cluttered and

occluded scenes.

下载后可阅读完整内容，剩余8页未读，立即下载

铮铭

粉丝: 76
资源: 14

点云中多视图3D物体识别的快速与稳健方法

点云中快速稳健的多视图3D物体识别

复现Multi-Objective Matrix Normalization：模型加载与精度探讨

"深度人脸识别综述：最新进展与关键元素

Multi-Scale Categorical Object Recognition Using Contour Fragments

CIFAR-10 - Object Recognition in Images-数据集

用于排课的matlab代码-View-Sequence-based-3D-object-recognition:论文“为3D形状识别深度开发长

Generative-Multi-View-Human-Action-Recognition:生成式多视图人类动作识别

ios-caffe-ObjectRecognition:对象识别演示应用

Human-level Moving Object Recognition from Traffic Video

Three-Dimensional-Object-Recognition-and-6-DoF-Pose-Estimation-C

最新资源