DM-VIO：单目视觉惯性里程计的延迟边缘化方法

需积分: 0 149 浏览量更新于2024-08-03 收藏 2.4MB PDF 举报

"DM-VIO：延迟边缘化单目视觉惯性里程计方法" 本文介绍的DM-VIO（Delayed Marginalization Visual-Inertial Odometry）是一种创新的单目视觉惯性里程计系统，旨在解决传统边缘化策略所面临的挑战。在SLAM（Simultaneous Localization And Mapping）领域，视觉惯性里程计是关键技术之一，它结合了摄像头和惯性测量单元（IMU）的数据来估计设备的运动轨迹。 DM-VIO的核心特点是采用了延迟边缘化和位姿图束调整（pose graph bundle adjustment）两种新方法。边缘化通常用于保持系统的实时性能，但它会使得已边缘化的状态难以恢复，同时限制了连接变量的线性化点的可调整性。延迟边缘化则提出了一种新的思路，即构建第二个因子图，其中的边缘化操作会被推迟，之后可以通过访问延迟图并使用新的一致线性化点进行更新，从而实现更灵活的系统优化。此外，DM-VIO利用延迟边缘化来处理IMU的初始化问题。传统的IMU初始化可能无法充分利用视觉数据的全部不确定性。然而，DM-VIO的位姿图束调整策略允许将IMU信息与已被边缘化的状态关联起来，提供了一个更全面的初始化方法。这一方法不仅可以捕获光度的完整不确定性，还能使用一致的线性化点生成更新的边缘化结果。为了解决单目视觉里程计中常见的初始尺度不可观测问题，DM-VIO在完成IMU初始化后，会在主系统中继续优化尺度和重力方向。这有助于确保系统对环境尺度变化的鲁棒性，并提高整体定位的精度。文章指出，DM-VIO的工作基于IEEERobotics and Automation Letters发表的研究成果，并遵循了严格的版权规定。通过这些创新，DM-VIO系统有望在实际应用中实现更高效、更精确的定位和导航，为机器人和自动化领域的SLAM算法提供一种强大的工具。

The idea is to maintain a second, delayed marginalization

prior, which has very little overhead, but enables three core

techniques:

1) We can populate the delayed factor graph with new

IMU factors to perform the proposed pose graph

bundle adjustment (PGBA). This is the basis of an

IMU initialization which captures the full photometric

uncertainty, leading to increased accuracy.

2) The graph used for IMU initialization can be re-

advanced, providing a marginalization prior with IMU

information for the main system.

3) When the scale changes signiﬁcantly in the main

system we can trigger marginalization replacement.

The combination of these techniques makes for a highly

accurate initializer, which is robust even to long periods of

unobservability. Based on it we implement a visual-inertial

odometry (VIO) system featuring a photometric front-end

integrated with a new dynamic photometric weight.

We evaluate our method on three challenging datasets

(Fig. 1), capturing three domains: The EuRoC dataset [7]

recorded by a ﬂying drone, the TUM-VI dataset [8] cap-

tured with a handheld device, and the 4Seasons dataset [9]

representing the automotive scenario. The latter features long

stretches of constant velocity, posing a particular challenge

for mono-inertial odometry.

We show that our system exceeds the state of the art in

visual-inertial odometry, even outperforming stereo-inertial

methods. In summary our contributions are:

• Delayed marginalization compensates drawbacks of

marginalization while retaining the advantages.

• Pose graph bundle adjustment (PGBA) combines the

efﬁciency of pose graph optimization with the full

uncertainty of bundle adjustment.

• A state-of-the-art visual-inertial odometry system with

a novel multi-stage IMU initializer and dynamically

weighted photometric factors.

The full source code for our approach will be released.

II. RELATED WORK

Initially, most visual odometry and SLAM systems have

been feature-based [10], either using ﬁltering [11] or non-

linear optimization [12] [13]. More recently, direct methods

have been proposed, which optimize a photometric error

function and can operate on dense [14] [15], semi-dense [16],

or sparse point clouds [17].

Mourikis and Roumeliotis [1] have shown that a tight

integration of visual and inertial measurements can greatly

increase accuracy and robustness of odometry. Afterwards,

many tightly-coupled visual-inertial odometry [18] [19] and

SLAM systems [20] [21] [3] [5] have been proposed.

Initialization of monocular visual-inertial systems is not

trivial, as sufﬁcient motion is necessary for the scale to

become observable [22] [2]. Most systems [4] [3] [5] start

with a visual-only system and use its output for a separate

IMU initialization. In contrast to these systems, we continue

optimizing the scale explicitly in the main system. We note

that ORB-SLAM3 [5] also continues to reﬁne the scale after

initialization, but this is a separate optimization ﬁxing all

poses and only performed until 75 seconds after initializa-

tion. [23] also continues to optimize the scale in the main

system, but in contrast to us they do not transfer covariances

between the main system and the initializer, thus they do

not achieve the same level of accuracy. Different to all these

systems, the proposed delayed marginalization allows our

IMU initializer to capture the full visual uncertainty and

continuously optimize the scale in the main system.

VI-DSO [6] initializes immediately with an arbitrary scale

and explicitly optimizes the scale in the main system. It

also introduced dynamic marginalization to handle the con-

sequential large scale changes in the main system. Compared

to it we propose a separate IMU initializer, delayed marginal-

ization as a better alternative to dynamic marginalization, a

dynamic photometric error weight, and more improvements,

resulting in greatly improved accuracy and robustness.

III. METHOD

A. Notation

We denote vectors as bold lowercase letters x, matrices as

bold upper-case letter H, scalars as lowercase letters λ, and

functions as uppercase letters E. T

w cam

∈ SE(3) represents

the transformation from camera i to world in the visual

coordinate frame V , and R

w cam

∈ SO(3) is the respective

rotation. Poses are represented either in visual frame P

cam

, or in inertial frame P

:= T

w imu

. If not mentioned

otherwise we use poses in visual frame P

:= P

. We also

use states s, which can contain transformations, rotations,

and vectors. For states we deﬁne the subtraction operator

s

, which applies log(R

−1

) for rotations and other Lie

group elements, and a regular subtraction for vector values.

B. Direct Visual-Inertial Bundle Adjustment

The core of DM-VIO is the visual-inertial bundle adjust-

ment performed for all keyframes. As commonly done, we

jointly optimize visual and IMU variables in a combined

energy function. For the visual part we choose a direct formu-

lation based on DSO [17], as it is a very accurate and robust

system. For integrating IMU data into the bundle adjustment

we perform preintegration [24] between keyframes.

We optimize the following energy function using the

Levenberg-Marquardt algorithm:

E(s) = W (e

photo

) · E

photo

+ E

imu

+ E

prior

(1)

prior

contains added priors on the ﬁrst pose and the gravity

direction, as well as the marginalization priors explained in

section III-C. In the following we describe the individual

energy terms and the optimized state.

Photometric error: The photometric energy is based on

[17]. We optimize a set of active keyframes F, each of which

hosts a set of points P

. Every point p is projected into all

keyframes obs(p) where it is visible, and the photometric

energy is computed:

photo

i∈F

p∈P

j∈obs(p)

(2)

剩余11页未读，继续阅读

灵境行者

粉丝: 0
资源: 5

DM-VIO：单目视觉惯性里程计的延迟边缘化方法

PL-VIO：视觉惯性融合定位新进展

PLS-VIO:弱纹理环境中点线特征融合的双目视觉惯导里程计

MSCKF-VIO完整代码src.zip压缩包使用指南

matlab的代码在相机上实现-duo_vio:Duo-VIO：快速，轻巧，立体惯性里程表

dm-vio: delayed marginalization visual-inertial odometry解读

Base path: /home/s/桌面/dm-vio/dm-vio/dm-vio-ros The specified base path "/home/s/桌面/dm-vio/dm-vio/dm-vio-ros" contains a package but "catkin_make" must be invoked in the root of workspace

Kimera-VIO-ROS:Kimera-VIO 的 ROS 包装器

PL-VIO:具有点和线特征的单眼视觉惯性系统

Kimera-VIO：具有SLAM功能和3D Mesh生成的视觉惯性里程表

R-VIO：以机器人为中心的视觉惯性里程表（IJRR2019，IROS2018）

最新资源