多相机密集RGB-D SLAM系统研究

需积分: 5 172 浏览量更新于2024-08-26 收藏 3.14MB PDF 举报

"Dense RGB-D SLAM with Multiple Cameras[1].pdf" 本文主要探讨了使用多个RGB-D（红绿蓝深度）相机进行稠密SLAM（Simultaneous Localization and Mapping，即同时定位与建图）的系统，该系统具有加速场景重建和提高定位精度的潜力。通过使用多个安装的传感器和扩大有效视场，可以显著提升SLAM系统的性能。文章重点研究了两个关键问题：一是如何在传感器视野小或不重叠的情况下进行系统标定，以最大限度地增加有效视场；二是如何有效地融合来自不同传感器的位置信息。首先，对于多相机系统的标定，由于各相机的视野可能只有小部分重合或完全不重合，这给传统的单目或双目相机标定方法带来了挑战。作者提出了一种新的标定方法，旨在优化各个相机的参数，使得整个系统能够覆盖更大的空间范围，从而扩大了SLAM的可工作区域。这种方法可能包括对每个相机的内外参进行独立标定，然后通过共享的特征点来调整它们之间的相对位置关系，以实现多相机间的精确同步和配准。其次，多相机SLAM系统中的数据融合是另一个关键技术。不同的RGB-D传感器可能提供不同的视点和深度信息，因此需要一种有效的融合策略来确保全局一致性和精度。文中可能涉及了滤波器方法（如扩展卡尔曼滤波器EKF或无迹卡尔曼滤波器UKF）、优化方法（如图优化）或者其他高级的数据融合技术，用于整合来自多个源的定位信息，降低不确定性并减少漂移。此外，论文可能还讨论了实时处理和计算效率的问题，这对于实际应用至关重要。多相机设置会增加数据量，因此需要高效的算法来处理大量输入，并保持系统运行的实时性。这可能涉及到并行计算、分布式处理或者特定硬件加速的利用。 "Dense RGB-D SLAM with Multiple Cameras[1].pdf"这篇论文详细研究了多相机环境下的稠密SLAM技术，包括系统标定和数据融合策略，这些研究成果对于增强现实、机器人导航、3D建模等领域有着重要的实践意义。通过解决这些问题，可以构建更强大、更稳健的多传感器SLAM系统，以应对复杂环境下的高精度定位和三维重建需求。

Sensors 2018, 18, 2118 3 of 12

We extend the state-of-the-art ElasticFusion [

] to a multi-camera system to get a better dense

RGB-D SLAM.

Sensors 2018, 18, x 3 of 11

Figure 1. Example of three-Kinect arrangement.

2. Extrinsic Calibration of Multiple Cameras

2.1. Odometer-Based Extrinsic Calibration

We run RGB-D visual odometry (VO) for each camera in a feature-rich scene to estimate a set of

camera poses which is required for the subsequent step of hand–eye calibration. Our RGB-D VO

method is similar to [21], which is the classical VO method for RGB-D SLAM. We perform a dense

iterated close point (ICP) method to estimate the camera pose, using a projective data association

algorithm [22] to obtain correspondence and a point-to-plane error metric for pose optimization.

Then we solve the optimization problem based on the GPU’s parallelized processing pipeline. The

point-to-plane error energy for the desired camera pose estimate T is

E=

(





()−



()

)

∙







∈

(1)

We track the current camera frame by aligning a live surface measurement (



,



) against the

model prediction from the previous frame (



,



), where Ω⊂ℕ



is the image space domain, v

is vertex, n is normal, and k is the timestamp. With the VO method, we obtain a set of camera poses.

Then we use the hand–eye calibration method of [7] to estimate each camera-odometry

transformation. The unknown camera-odometry transformation is estimated in two steps. In the first

step, the rotation cost function is minimized to estimate the pitch and roll angles of the camera-

odometry transformation. In the second step, the translation cost function is minimized to estimate

the yaw angle and the camera-odometry translation. The relationship between camera and robot can

be expressed as a rotation formula and a translation formula as

















=





,









(2)

󰇡󰇡 









󰇢−󰇢 





=

(







)











−









(3)

In the above, the rotation is represented by quaternion, and the translation by a vector. The

robot’s transformation between time i and time i + 1 is denoted by the vector 









and the unit

quaternion 









, which can be obtained from the robot’s inertial measurement unit. 









and 









represent the camera’s transformation between time i and time i + 1 which can be obtained by the

above VO method.





and 





represent the transformation between the robot and the camera. In

the first step, we decompose the unknown unit quaternion 





into three unit quaternions,

corresponding to Z–X–Y. Euler angles α, β, γ as







=



(



)





(

,

)

(4)

Since both 









and 



() represent rotations around the z axis, they satisfy commutative law.

After simplifying Function (2), the rotation residual term becomes

Figure 1. Example of three-Kinect arrangement.

2. Extrinsic Calibration of Multiple Cameras

2.1. Odometer-Based Extrinsic Calibration

We run RGB-D visual odometry (VO) for each camera in a feature-rich scene to estimate a set

of camera poses which is required for the subsequent step of hand–eye calibration. Our RGB-D

VO method is similar to [

], which is the classical VO method for RGB-D SLAM. We perform

a dense iterated close point (ICP) method to estimate the camera pose, using a projective data

association algorithm [

] to obtain correspondence and a point-to-plane error metric for pose

optimization. Then we solve the optimization problem based on the GPU’s parallelized processing

pipeline. The point-to-plane error energy for the desired camera pose estimate T is

E =

∑

u∈Ω

((

(

)

− v

k−1

(

))

·n

k−1

)

. (1)

We track the current camera frame by aligning a live surface measurement (

) against the

model prediction from the previous frame (

k−1

), where

Ω ⊂ N

is the image space domain,

vertex, n is normal, and k is the timestamp. With the VO method, we obtain a set of camera poses.

Then we use the hand–eye calibration method of [

] to estimate each camera-odometry

transformation. The unknown camera-odometry transformation is estimated in two steps. In the

ﬁrst step, the rotation cost function is minimized to estimate the pitch and roll angles of the

camera-odometry transformation. In the second step, the translation cost function is minimized

to estimate the yaw angle and the camera-odometry translation. The relationship between camera and

robot can be expressed as a rotation formula and a translation formula as

i+1

q =

i+1

q, (2)



i+1



− I



p = R





i+1

p −

i+1

p. (3)

In the above, the rotation is represented by quaternion, and the translation by a vector. The robot’s

transformation between time i and time i + 1 is denoted by the vector

i+1

and the unit quaternion

i+1

, which can be obtained from the robot’s inertial measurement unit.

i+1

and

i+1

represent

the camera’s transformation between time i and time i + 1 which can be obtained by the above VO

method.

and

represent the transformation between the robot and the camera. In the ﬁrst step,

剩余11页未读，继续阅读

donghanruchen

粉丝: 0
资源: 37

多相机密集RGB-D SLAM系统研究

J_dense_0-278.npy

cnocr-v2.3-densenet-lite-136-gru-epoch=004-ft-model.onnx

CycleMLP_A_MLP-like_Architecture_for_Dense_Predic_Paddle-C

quasidensedemo.tar.gz_2007_CVPR MATCHING_Dense matlab_wide-basel

Desktop.rar_5G 全双工_antenna 2020_desktop-6lppq5g_ultra dense_密集网络

带有RGB-D相机的DENSE FRAME-TO-MODEL SLAM

dense_trajectory_release.tar.gz_dense_trajectory_trajectory_人的行为

dygraph-combined.rar_JAVLibrary 情报_JaVⅤan_dygraph-combined.js_ti

torch_sparse-0.6.17-cp310-cp310-macosx_10_15_x86_64.whl.zip

torch_sparse-0.6.15-cp310-cp310-macosx_10_15_x86_64.whl.zip

最新资源