基于RGB-D数据的高动态环境下的非参数统计和聚类方法

需积分: 0 199 浏览量更新于2024-08-05 收藏 1.09MB PDF 举报

非参数统计和聚类在高动态环境下的 RGB-D 数据处理方法本文提出了一种基于非参数统计和聚类的方法来处理高动态环境下的 RGB-D 数据，以实现dense visual odometry（DVO）。该方法通过对多帧图像的残差计算来检测动态对象，然后使用场景聚类模型和非参数统计模型来获取加权的聚类残差。最后，将运动分割标签和聚类权重添加到能量函数优化中，以实现dense visual odometry。具体来说，本方法包括以下几个步骤： 1. 多帧残差计算：计算多帧图像之间的残差，以检测动态对象。 2. 场景聚类模型：使用聚类算法对场景进行聚类，以区分静态对象和动态对象。 3. 非参数统计模型：使用非参数统计模型来描述聚类残差，以获得加权的聚类残差。 4. 能量函数优化：将运动分割标签和聚类权重添加到能量函数优化中，以实现dense visual odometry。该方法的优点是可以处理高动态环境下的 RGB-D 数据，检测动态对象，并实现dense visual odometry。该方法的应用前景包括机器人视觉、自动驾驶、 aumented reality 等领域。在高动态环境下，dense visual odometry 是一个具有挑战性的问题，特别是在有动态对象存在的情况下。本方法通过使用非参数统计和聚类模型来处理 RGB-D 数据，实现了dense visual odometry，解决了该问题。在该方法中，多帧残差计算是检测动态对象的关键步骤。该步骤可以检测出动态对象，并将其与静态对象区分开来。然后，场景聚类模型和非参数统计模型可以对聚类残差进行加权，以获得更加准确的结果。本方法的优点是可以处理高动态环境下的 RGB-D 数据，检测动态对象，并实现dense visual odometry。该方法的应用前景包括机器人视觉、自动驾驶、 aumented reality 等领域。本方法是一种有效的解决方案，能够处理高动态环境下的 RGB-D 数据，检测动态对象，并实现dense visual odometry。

nonparametric method. The method is based on pixel-

wise segmentation and the statistical model for pose

estimation, which is susceptible to independent non-

rigid body motion of past frames, and therefore some

dynamic pixels may be include d in the energy function

minimization. Meanwhile, Scona et al. [24] proposed

StaticFusion. They maintain a static background

environment mapping used for pose estimation in a

model-to-frame way. However, StaticFusion cannot

work on scenes with fast camera motion due to not

having enough time for mapping. Finally, Sun et al.

[25] proposed a motion removal approach as a pre-

processing ste p and integrated it into the front end of

RGB-D SLAM. However, the disadvantage of this

method is that they can deal with only one motion,

instead of multiple moving objects in dynamic scenes.

Another line of related works are off-line methods.

Roussos et al. [26] proposed an approach of multi-

body motion segmentation and reconstruction based

on the energy function. The algorithm effectively

gives the camera pose, scene depth, and 3D recon-

struction in dynamic scenes. Unfortunately, the

method processes RGB-D data in a batch way and

hence can be seen as an off-line system. Wang et al.

[27] estimated dense optical ﬂow from frames, where

dynamic objects can be excluded by clustering motion

patterns based on optical ﬂow. However, due to the

large amount of calculations, they could not achieve

real-time performance.

Regarding motion segmentation, Stu

ckler et al. [28]

proposed an efﬁcient real-time dense motion segmen-

tation, whose weakness is that it is only applicable to

rigid body segmentation. Although some unsupervised

learning based methods [29–31] were recently pro-

posed and achieved good results, they cannot always

perform well in other special dynamic scenes since

they need a large dataset for training a network; thus,

they suffer from the generalization problem.

3 Dense Visual Odometry Approach

3.1 Overview

In this paper, we proposed a visual odometry approach

based on the nonparametric statistical model and the

clustering model. The overview is shown in Fig. 1.

First, K-means clustering was used to segment each

frame into N clusters based on depth and intensity.

Each cluster was considered to be a rigid body, and

thus the pixel-wise motio n segmentation problem was

simpliﬁed into a cluster-wise segmentation. Second,

the initial camera pose was calculated by minimized

photometric and depth residuals in a Cauchy M-esti-

mator, and then the estimated poses were used to warp

previous frames to the current frame coordinate. After

regularization, these warped frames were used to

compute temporal consistency residuals for each

cluster, which ensured the continuity of clusters’

motion. Third, temporal consistent residuals were used

to build a nonparametric statistical model based on the

t-distribution and to ﬁnd moving objects by utilizing a

dynamic threshold condition. Finally, the probability

conﬁdence of each static cluster based on the statistical

model was regarded as weight that would be incorpo-

rated into the energy function optimization for

obtaining a more accurate camera pose estimation.

Afterward, the warp function was updated based on a

new estimated transformation for the next iteration.

3.2 Preliminaries

Since the RGB-D sensor simultaneously provides a

color image and depth image, a pair of frames

k1

; Z

k1

ðÞand I

; Z

ðÞis given as input, where I xðÞ2

R and Z xðÞ2R represe nt the intensity and depth,

respectively, of pixel x ¼ u; v

ðÞ

2 R

. Intensity is

converted from the color image (0.299R ? 0.587G ?

0.114B). In the homogeneous coordinate, given a 3D

point p ¼ X

; Y

; Z

; 1ðÞ

, the projection function and

its inverse function between the 3D point and its pixel

on the image is as follows:

x ¼ p p

ðÞ¼

þ o

;

þ o



ð1Þ

¼ p

1

x; Z

ðÞ¼

u  o

;

v  o

; Z

; 1



ð2Þ

where f

and f

are the focal lengths and o

; o



is the

principal point.

As the camera moves, the 3D point p in the preview

frame’s camera coordinate can be transformed rigidly

to the current frame with the transformation matrix

k1

2 SE 3ðÞ. The new coordinate of the 3D point in

the current camera coordinate can be obtained by the

following function:

123

3D Res (2019) 10:11 Page 3 of 11 11

剩余10页未读，继续阅读

df595420469

粉丝: 32
资源: 310

基于RGB-D数据的高动态环境下的非参数统计和聚类方法

一种利用RGBD数据处理高动态环境的非参数统计和聚类的方法2

动态环境中的运动聚类1

基于RGBD数据的车辆随动系统车辆检测算法。

一个用于点云数据的精益c++库- kzampog/cilantro

奥比中光 3D 视觉创新应用竞赛-轻量化、松耦合的手持 RGB-D 室内环境实时重建系统 .zip

bunny.ply.zip_bunny_ply下载_点云_点云数据下载

快速平面提取技术：聚集层次聚类在点云数据中的应用

RGBD深度感知物体检测：基准与算法

JCSA-RM方法实现RGB-D图像分割MATLAB源码库

cilantro：C++库优化3D点云处理与几何分析

最新资源