RGB-D相机在移动机器人中的人员检测与跟踪技术

181 浏览量更新于2024-08-26 收藏 1.51MB PDF 举报

"使用RGB-D相机对移动机器人进行人员检测和跟踪" 这篇研究论文探讨了如何利用RGB-D相机实现移动机器人的人员检测和跟踪，这是自然人机交互的关键能力。文章提出了一套系统，该系统包括人体检测、跟踪和再识别三个主要部分，旨在提升机器人在复杂环境中的感知和互动效能。首先，为了减少计算负担，系统通过去除地面点和天花板平面来预处理数据。这一步是通过一种基于先验知识的随机样本共识（RANSAC）拟合算法实现的，该算法能够有效地检测到地面平面和天花板点。接着，所有剩余的点被投影到地面平面上，并进一步分割成子聚类，作为候选的检测对象。采用Epanechnikov核的均值漂移聚类方法对这些点进行分区，将不同的点聚集到各自的子聚类中。论文中提出了一个创新的概念——空间兴趣区域平面视图地图，用于从点云子聚类中识别出人类候选者。通过在线提取深度加权直方图，这个方法能够帮助区分出可能是人体的特征区域，提高了检测的准确性。然后，对于人体跟踪，论文可能采用了某种基于卡尔曼滤波或者粒子滤波的算法，这些算法能够持续追踪检测到的人体并预测其未来的运动状态，即使在短暂遮挡或目标离开视线后也能重新识别。此外，论文还可能涉及了鲁棒性处理，如多传感器融合，以应对RGB-D数据的噪声和不准确性。这可能包括对深度图像的预处理，以及与其他传感器（如激光雷达或惯性测量单元）的数据融合，以提高整体系统的稳定性和可靠性。最后，作者们进行了实验验证，通过实际场景下的测试数据来评估所设计系统的性能。实验结果可能展示了在不同环境条件和人群密度下，该系统在人体检测和跟踪方面的准确性和实时性。这篇研究论文为移动机器人的人体检测和跟踪提供了一个有效且实用的方法，对于智能服务机器人、安防监控等领域具有重要的应用价值。通过结合颜色信息和深度信息，RGB-D相机的潜力被充分挖掘，为实现更加智能和自主的机器人行为奠定了基础。

Contributions

Our human perception method combines a set of novel

techniques to create a system that is capab le of trac ki ng

multiple human targets and rejecting nonhuman objects

from a mobile robot. What’s more, our perception system

is robust to occlusion, illustration changes, and unpre-

dicted motion patterns. The main contributions of this

article include:

1. the introduction of a new idea using meanshift clus-

tering candidate se gmentation for plan view map

generation from a RGB-D camera, which allows

us to avoid using the noisy point cloud for compu-

tationally expensive plan view map generation, aug-

ment detection precision, and speed up to achieve

real-time performance;

2. the use of point cloud preprocessing, where planes,

cylinders, or other regular objects are removed to

lower the false positive ratio, followed by tracking-

by-detection over a 3D point cloud that associates

motion tracking and object detection which can

extensively be applied to HRI.

The remainder of the article is organized as follows. The

second section overviews the related literature of human

perception. The third section depicts our approaches to

detect and track multiple humans in preprocessed 3D point

clouds. Experimental results are presented in the fourth

section. Finally, the conclusion of this article is given in

the fifth section.

Related work

To achieve natural human perception in crowded human

zones, a large number of human detection and tracking

approaches have been investigated. Using a consumer-

grade camera is cost-efficient, so it is widely adopted in

human detection and tracking. To detect and track people in

the real world from a moving camera, great efforts have

been made. A probabilistic framework was proposed

detect multiple people in a crowded scene by combining

multiple detectors. By combining multiple detectors, the

Reversible Jump Markov Chain Monte Carlo particle filter-

ingmethodwasadoptedtofindmaximumaposteriori

probability (MAP) of a posterior probability to track people

in a single coherent framework. Mekonnen et al.

designed

a cooperative perception system made up of wall mounted

cameras and a mobile r obot t o perceive passers- by and

obtain their positions and trajectories. Jia et al.

presented

a visual human tra cking approach based on a meanshift

algorithm. In their implementation, color and texture histo-

grams were integrated into a meanshift tracker under the

double-layer locating mechanism. The Histogram of

Oriented Gradient (HOG),

also known as the Dalal–Triggs

detector, was introduced to localize people utilizing a slid-

ing window and support vector machines (SVM) to discri-

minate people from others. A drawback of using a single

camera is that occlusion causes a false negative.

What’s more, a legTracker

was proposed to detect and

track human legs by the application of the support vector

data description scheme using measurement from a laser

range finder. In addition, networks of laser range finders

were calibrated to determine the positions of pedestrians,

which enabled pedestrian tracking within 11 cm accu-

racy.

But these laser range finder based human detection

and tracking systems provide only partial depth informa-

tion about a single plane.

3D sensors, such as a 3D-laser, 3D rotating Lidar,

stereo ca mera, ToF camer a, and RGB-D camera, can pro-

vide 3 D position i nforma tion and spatial geometric con-

straints of a human. W ith the assistance o f such 3D spatial

information, the r obot knows how people move about in

the surrounding environment. Depth sensing technology

assisted human dete ction and tracking systems have also

been extensively discuss ed.

1. 3D-lasers. Spinello et al.

proposed a novel

approach for pedestrian detection in a 3D range data

Figure 1. Overview of our multiple human detection and tracking system. Starting with the input Point Cloud Data (PCD) point cloud,

the system: (1) detects the ground and ceiling planes and removes them; meanwhile, a prior-knowledge guided random sample

consensus (RANSAC) is used to fit the ground plane; (2) projects all points onto the ground plane, and applies a meanshift clustering

algorithm to segment candidates for generating plan view maps; (3) associates motion and detection data for multiple human object

tracking. Our tracking results are demonstrated using a bounding box in which a human is tracked.

Liu et al. 3

剩余10页未读，继续阅读

weixin_38696336

粉丝: 3
资源: 921

RGB-D相机在移动机器人中的人员检测与跟踪技术

论文研究-基于RGB-D相机的室内环境3D地图创建.pdf

matlab的代码在相机上实现-peopleTracker:对RGB-D数据进行人员检测和跟踪

基于激光与RGB-D相机的异构多机器人协作定位.pdf

一种基于RGB-D的移动机器人未知室内环境自主探索和地图构建方法

人工智能-机器学习-移动机器人目标识别及跟踪搬运策略研究.pdf

室内移动机器人RGB-D+SLAM算法研究1

RGB-D相机驱动的移动机器人三维地图构建与定位方法

移动机器人三维地图创建：基于RGB-D相机的混合位姿估计方法

带有RGB-D相机的DENSE FRAME-TO-MODEL SLAM

基于RGB-D相机的SLAM技术研究综述

最新资源