多传感器优化框架：VINS-fusion视觉与IMU SLAM

下载需积分: 49 | PDF格式 | 3.23MB | 更新于2024-08-04 | 132 浏览量 | 举报

1 收藏

本文主要探讨了"VINS-fusion"，一种针对视觉与惯性测量单元（IMU）融合的视觉同时定位与映射（SLAM）方法。在当前的机器人技术发展中，为了增强系统的稳定性和自主性，越来越多的传感器被安装在不同的平台上，如地面车辆上的立体相机、智能手机上的单眼相机配IMU，以及无人机上的双目相机配IMU。传统的SLAM算法往往针对单一传感器或特定传感器组合进行设计，难以适应多种传感器的选择。该论文提出了一种通用优化框架，用于多传感器本地定位估计。这个框架将每种传感器视为全局优化问题中的一个通用因子，通过合并共享状态变量来构建优化问题。这样做的好处在于提高了算法的灵活性，能够处理各种传感器组合，包括立体相机、配备IMU的单眼相机以及双目相机配IMU等不同传感器套件。作者Tong Qin、Jie Pan、Shaozu Cao和Shaojie Shen的研究工作旨在克服单一传感器算法的局限性，实现对复杂环境下的高精度状态估计。他们强调了这种方法的通用性，即不论传感器类型如何变化，只要满足共享状态变量的条件，都可以融入到该优化框架中。这为机器人系统集成更多元化的传感器提供了可能性，从而提高其在动态环境中的导航和感知能力。在实践中，VINS-fusion可能采用先进的滤波器（如EKF或UKF）或者优化技术（如图优化或粒子滤波）来融合来自视觉和IMU的数据，结合特征匹配、位姿估计和运动模型，以实现更精确、鲁棒的SLAM解决方案。此外，论文还可能讨论了如何处理传感器噪声、数据融合策略以及实时性能优化等问题。 VINS-fusion论文为解决多传感器环境下SLAM问题提供了一个创新的解决方案，推动了机器人领域的技术进步，对于提升自主机器人在动态环境中的导航性能具有重要意义。

展开

A General Optimization-based Framework for Local Odometry

Estimation with Multiple Sensors

Tong Qin, Jie Pan, Shaozu Cao, and Shaojie Shen

Abstract— Nowadays, more and more sensors are equipped

on robots to increase robustness and autonomous ability. We

have seen various sensor suites equipped on different platforms,

such as stereo cameras on ground vehicles, a monocular camera

with an IMU (Inertial Measurement Unit) on mobile phones,

and stereo cameras with an IMU on aerial robots. Although

many algorithms for state estimation have been proposed in the

past, they are usually applied to a single sensor or a speciﬁc

sensor suite. Few of them can be employed with multiple sensor

choices. In this paper, we proposed a general optimization-based

framework for odometry estimation, which supports multiple

sensor sets. Every sensor is treated as a general factor in our

framework. Factors which share common state variables are

summed together to build the optimization problem. We further

demonstrate the generality with visual and inertial sensors,

which form three sensor suites (stereo cameras, a monocular

camera with an IMU, and stereo cameras with an IMU). We

validate the performance of our system on public datasets and

through real-world experiments with multiple sensors. Results

are compared against other state-of-the-art algorithms. We

highlight that our system is a general framework, which can

easily fuse various sensors in a pose graph optimization. Our

implementations are open source

I. INTRODUCTION

Real-time 6-DoF (Degrees of Freedom) state estimation

is a fundamental technology for robotics. Accurate state

estimation plays an important role in various intelligent

applications, such as robot exploration, autonomous driving,

VR (Virtual Reality) and AR (Augmented Reality). The most

common sensors we use in these applications are cameras. A

large number of impressive vision-based algorithms for pose

estimation has been proposed over the last decades, such as

[1]–[5]. Besides cameras, the IMU is another popular option

for state estimation. The IMU can measure acceleration and

angular velocity at a high frequency, which is necessary for

low-latency pose feedback in real-time applications. Hence,

there are numerous research works fusing vision and IMU

together, such as [6]–[12]. Another popular sensor used in

state estimation is LiDAR. LiDAR-based approaches [13]

achieve accurate pose estimation in a conﬁned local envi-

ronment. Although a lot of algorithms have been proposed

in the past, they are usually applied to a single input sensor

or a speciﬁc sensor suite.

Recently, we have seen platforms equipped with various

sensor sets, such as stereo cameras on ground vehicles, a

monocular camera with an IMU on mobile phones, stereo

All authors are with the Department of Electronic and

Computer Engineering, Hong Kong University of Science and

Technology, Hong Kong, China. {tong.qin, jie.pan,

shaozu.cao}@connect.ust.hk, eeshaojie@ust.hk.

https://github.com/HKUST-Aerial-Robotics/VINS-Fusion

Fig. 1. An illustration of the proposed framework for state estimation,

which supports multiple sensor choices, such as stereo cameras, a monocular

camera with an IMU, and stereo cameras with an IMU. Each sensor is

treated as a general factor. Factors which share common state variables are

summed together to build the optimization problem.

cameras with an IMU on aerial robots. However, as most

traditional algorithms were designed for a single sensor or

a speciﬁc sensor set, they cannot be ported to different

platforms. Even for one platform, we need to choose dif-

ferent sensor combinations in different scenarios. Therefore,

a general algorithm which supports different sensor suites

is required. Another practical requirement is that in case

of sensor failure, an inactive sensor should be removed

and an alternative sensor should be added into the system

quickly. Hence, a general algorithm which is compatible with

multiple sensors is in need.

In this paper, we propose a general optimization-based

framework for pose estimation, which supports multiple

sensor combinations. We further demonstrate it with visual

and inertial sensors, which form three sensor suites (stereo

cameras, a monocular camera with an IMU, and stereo cam-

eras with an IMU). We can easily switch between different

sensor combinations. We highlight the contribution of this

paper as follows:

• a general optimization-based framework for state esti-

mation, which supports multiple sensors.

• a detailed demonstration of state estimation with visual

and inertial sensors, which form different sensor suites

(stereo cameras, a monocular camera + an IMU, and

stereo cameras + an IMU).

• an evaluation of the proposed system on both public

datasets and real experiments.

arXiv:1901.03638v1 [cs.CV] 11 Jan 2019

下载后可阅读完整内容，剩余6页未读，立即下载

身份认证购VIP最低享 7 折!

30元优惠券

JaydenQ

粉丝: 1305

多传感器优化框架：VINS-fusion视觉与IMU SLAM

MATLAB汽车模型搭建与VINS-Fusion系统分析

VINS-Fusion：融合视觉与惯性导航的SLAM技术

VINS-Fusion：多传感器状态估计器的C++实现及下载指南

vins-fusion 有什么可以发论文的改进意见

VINS系列（一）-Vins-Funsion环境配置

VINS-SLAM

VINS（Visual-Inertial Navigation System）简介

VINS自适应滤波技术：研究与应用的全面解析

VINS系统初始化必读：关键步骤与问题的全面剖析

VINS-Fusion的鱼眼版GPU加速开发教程

最新资源