点云抓取评估模型PointNetGPD：从点集检测抓取配置

需积分: 0 79 浏览量更新于2024-08-26 收藏 3.08MB PDF 举报

"PointNetGPD:DetectingGraspConﬁgurationsfromPointSets" 这篇PDF文档，"PointNetGPD.pdf"，主要介绍了一种名为PointNetGPD（PointNet for Grasp Pose Detection）的新颖端到端抓取评估模型，专门用于从3D点云数据中直接定位机器人的抓取配置。作者包括Hongzhuo Liang、Xiaojian Ma等人，他们提出的方法旨在解决在点云数据上直接进行抓取定位的难题，这是一个极具挑战性的任务。传统方法通常依赖于手工设计的深度特征和卷积神经网络（CNN），而PointNetGPD则更轻量级，能够直接处理位于夹持器内的3D点云数据进行抓取评估。这种方法的优势在于，即使点云数据非常稀疏，也能捕捉到夹持器与物体接触区域的复杂几何结构。这有助于提高模型在处理不完整或噪声数据时的鲁棒性。为了增强模型的训练效果，研究团队创建了一个大规模的抓取数据集，包含了35万条真实世界的点云数据和与YCB对象集关联的抓取信息。YCB对象集是一个广泛使用的基准测试集合，包含多种日常物品，用于机器人抓取研究。 PointNetGPD的性能通过模拟实验和实际机器人硬件实验进行了定量评估。在对象抓取和杂物清理等任务中，该模型展示出良好的表现。这些实验结果证实了PointNetGPD在处理3D点云数据上的有效性和实用性，为机器人抓取技术提供了新的思路和工具。总结来说，"PointNetGPD.pdf"这篇论文介绍了一种基于PointNet的新型抓取评估模型，该模型能够直接处理3D点云数据，无需预先提取深度特征，并在大规模数据集上进行训练，从而提高了在复杂环境下的抓取定位能力。这一创新方法对于推动机器人学，特别是在自动化抓取和物体操纵领域，具有重要意义。

PointNetGPD: Detecting Grasp Conﬁgurations from Point Sets

Hongzhuo Liang

1†

, Xiaojian Ma

2†

, Shuang Li

, Michael G

orner

, Song Tang

Bin Fang

, Fuchun Sun

2∗

, Jianwei Zhang

Abstract— In this paper, we propose an end-to-end grasp

evaluation model to address the challenging problem of local-

izing robot grasp conﬁgurations directly from the point cloud.

Compared to recent grasp evaluation metrics that are based on

handcrafted depth features and a convolutional neural network

(CNN), our proposed PointNetGPD is lightweight and can di-

rectly process the 3D point cloud that locates within the gripper

for grasp evaluation. Taking the raw point cloud as input, our

proposed grasp evaluation network can capture the complex

geometric structure of the contact area between the gripper

and the object even if the point cloud is very sparse. To further

improve our proposed model, we generate a larger-scale grasp

dataset with 350k real point cloud and grasps with the YCB

object set for training. The performance of the proposed model

is quantitatively measured both in simulation and on robotic

hardware. Experiments on object grasping and clutter removal

show that our proposed model generalizes well to novel objects

and outperforms state-of-the-art methods. Code and video are

available at https://lianghongzhuo.github.io/PointNetGPD.

I. INTRODUCTION

Planning a grasp under uncertainty is a difﬁcult task

in robotics. For a robot that operates in the real world,

uncertainty may come from varied aspects. In this paper,

we mainly concentrate on the uncertainty brought by the

imprecision and deﬁciency in sensing. This kind of uncer-

tainty is usually associated with the sensor we use for robotic

perception [1]. To address this problem, a grasping model

that can work with raw sensor input is needed. Some recent

advances suggest to use deep neural networks that have been

trained on large-scale grasp dataset labeling by humans [2],

[3] or grasping outcomes done by robotic hardware [4], [5]

to plan grasps directly with sensor input like images [6] or

point cloud [7]. Such research work yields promising results

across a wide variety of objects, sensors, and robots, and

their models generalize well to novel objects that are not

present in the training set. However, most of the current

methods still rely on 2D (image) or 2.5D (depth map) input;

some grasping models even require complex hand-crafted

features [8] before they can process the data, while very

few of them will take the 3D geometry information into

consideration [9]. Intuitively, whether a grasp is successful

or not is always related to how the robot (gripper) interacts

with the object surface in 3D space; thus the lack of geometry

†These two authors contributed equally. This work was done when

Hongzhuo Liang was visiting Tsinghua University.

TAMS (Technical Aspects of Multimodal Systems), Department of

Informatics, Universit

at Hamburg

Tsinghua National Laboratory for Information Science and Technology

(TNList), State Key Lab on Intelligent Technology and Systems, Department

of Computer Science and Technology, Tsinghua University

*Corresponding author to provide e-mail: fcsun@tsinghua.edu.cn

Robot Initial State

Quality Evaluation

with PointNet

Executed Grasp

Grasp Candidates

Generation

Grasp Dataset

Best

Grasp

Fig. 1. An illustration of our proposed PointNetGPD for detecting

reliable grasp conﬁguration from point sets. Taking raw sensor input from

a common RGB-D camera, we ﬁrst convert the depth map into a point

cloud, then several grasp candidates will be sampled with essential geometry

information as heuristic or constraints. For each candidate, the point cloud

within the gripper will be cropped and transformed into local coordinates

and ﬁnally fed into our grasp quality evaluation network. The grasp with

the highest score will be executed. Our model is trained with a large-scale

grasp dataset based on the YCB [10] object set.

analysis could entail side effects to grasp planning, especially

when accurate and complete sensing is not available.

To tackle these unsolved issues, inspired by the recent

work of PointNet [11] that directly operates on point sets

for 3D object classiﬁcation and segmentation, in this work,

we propose a point cloud based grasp detection method for

detecting reliable grasp conﬁgurations from the point cloud.

As illustrated in Figure 1, PointNetGPD provides an effective

pipeline to generate and evaluate grasp conﬁgurations. Com-

pared with previous grasp detection methods that depend on

multi-view CNN [8] or 3D-CNN [12], our approach does

not require point cloud projection on multiple 2D images

or rasterization into dense 3D volumes. As a result, it could

mostly sustain the geometric information of the original point

cloud and infer grasp quality more efﬁciently.

Recent success in deep neural network based grasp detec-

tion methods [3], [6] emphasizes the importance of training

on large-scale datasets. To further improve the performance

of the proposed grasp detection method, we built a grasp

dataset with a 350k real point cloud captured by depth

cameras, parallel-jaw grasps and analytic grasp metrics over

a subset of the YCB [10] object set. Different from other

grasp datasets like Dex-Net [3], we provide ﬁne-grained

scores for each grasp instead of binary labels. Speciﬁcally,

given a 6D grasp pose and CAD model of an object, we

perform force-closure [13] and a friction-less grasp wrench

space (GWS) [14] analysis on the grasp respectively to obtain

such scores. Quantitative scores make the more ﬂexible

label assignment possible during training, which could also

improve the performance of our grasp quality evaluation

arXiv:1809.06267v4 [cs.RO] 18 Feb 2019

下载后可阅读完整内容，剩余6页未读，立即下载

小码1号

粉丝: 10
资源: 15

点云抓取评估模型PointNetGPD：从点集检测抓取配置

Bio IK 2.0

列举有关3d抓取检测任务近两年的论文

vtk打开sdf文件

安装mayavi报错type object 'StyleHint' has no attribute 'AnyStyle'

数学建模学习资料 姜启源数学模型课件 M04 数学规划模型 共85页.pptx

【大越期货-2024研报】生猪期货早报.pdf

数学建模学习资料 姜启源数学模型课件 M07 差分方程模型 共33页.pptx

【宝城期货-2024研报】宝城期货品种套利数据日报.pdf

从头开始的 YOLOv1.zip

YOLOv4 通过 TensorRT 加速，并使用 Deepstream 实现多流输入.zip

最新资源

数学建模学习资料姜启源数学模型课件 M04 数学规划模型共85页.pptx

数学建模学习资料姜启源数学模型课件 M07 差分方程模型共33页.pptx