基于成像激光雷达的稳健场景识别方法

需积分: 9 57 浏览量更新于2024-08-10 收藏 3.63MB PDF 举报

"这篇论文提出了一种利用成像激光雷达进行鲁棒实时定位的方法，能够处理反向重访和上下颠倒的情况，并具有旋转不变性。" 在自动驾驶、机器人导航和增强现实等领域，准确的地点识别是至关重要的。这篇名为"使用成像激光雷达的鲁棒位置识别"的论文介绍了一种新的方法，它结合了激光雷达和视觉传感器的优势，以提高位置识别的准确性和鲁棒性。以下是对该方法的详细解释：首先，成像激光雷达（LiDAR）能够提供高分辨率的3D点云数据，这些数据包含强度信息。论文利用这些强度读数将点云投影成一种类似图像的数据形式，称为强度图像。这样做的好处是可以利用现有的图像处理技术来分析和理解点云数据。接下来，作者应用ORB（Oriented FAST and Rotated BRIEF）特征描述符从强度图像中提取关键点。ORB是一种高效且鲁棒的特征匹配算法，能够在不同的光照和角度下保持稳定。提取出的特征被编码成一个词袋（Bag-of-Words）向量，这是一种常用于图像分类和检索的数据表示方法。为了实现快速的位置识别查询，这个向量被插入到一个由DBoW（Dictionary of Words）维护的数据库中。DBoW是一种用于大规模视觉词汇表构建的工具，可以高效地处理大量特征向量，从而快速找到与当前场景最相似的候选位置。然而，简单的数据库匹配可能产生误匹配，因此论文引入了进一步的验证步骤。通过匹配视觉特征描述符，使用RANSAC（Random Sample Consensus）算法和 Perspective-n-Point (PnP) 模型来排除匹配中的异常值。PnP算法解决了在欧几里得空间中视觉特征位置的投影误差最小化问题，以确定它们在2D图像空间中的对应关系。RANSAC则用于剔除异常匹配，提高匹配的准确性。这种方法的一个显著优势在于，由于它结合了LiDAR和相机的特点，因此具有旋转不变性。这意味着无论车辆或机器人的朝向如何变化，系统都能准确识别位置。此外，该方法还能处理反向重访和上下颠倒的情况，这在实际环境中是非常实用的。论文最后通过在实际采集的数据集上进行评估，证明了该方法的有效性和优越性。这种方法对于提升自动驾驶系统和移动机器人的定位能力有着重要的意义，有助于实现更加安全和可靠的自主导航。

Robust Place Recognition using an Imaging Lidar

Tixiao Shan, Brendan Englot, F

abio Duarte, Carlo Ratti, and Daniela Rus

Abstract— We propose a methodology for robust, real-time

place recognition using an imaging lidar, which yields image-

quality high-resolution 3D point clouds. Utilizing the intensity

readings of an imaging lidar, we project the point cloud

and obtain an intensity image. ORB feature descriptors are

extracted from the image and encoded into a bag-of-words

vector. The vector, used to identify the point cloud, is inserted

into a database that is maintained by DBoW for fast place

recognition queries. The returned candidate is further validated

by matching visual feature descriptors. To reject matching

outliers, we apply PnP, which minimizes the reprojection

error of visual features’ positions in Euclidean space with

their correspondences in 2D image space, using RANSAC.

Combining the advantages from both camera and lidar-based

place recognition approaches, our method is truly rotation-

invariant, and can tackle reverse revisiting and upside down

revisiting. The proposed method is evaluated on datasets

gathered from a variety of platforms over different scales

and environments. Our implementation is available at https:

//git.io/imaging-lidar-place-recognition.

I. INTRODUCTION

Place recognition plays an important role in many mo-

bile robotics applications, such as solving the kidnapped

robot problem, localizing a robot in a known map, and

maintaining the accuracy of simultaneous localization and

mapping (SLAM). During the last two decades, a variety

of place recognition methods have achieved great success

in tackling such problems using camera, lidar, and other

perceptual sensors. Camera-based place recognition methods

often extract visual features from textured scenes and ﬁnd

candidates using a bag-of-words approach. However, such

methods are subject to illumination and viewpoint change.

On the other hand, lidar-based place recognition methods,

which often extract local or global descriptors from a point

cloud, are invariant to such changes. The long detection

range and wide aperture of lidar permit the capture of many

structural details of an environment. Yet such details are often

discarded during descriptor extraction, which may result

in false positive detections when surrounded by repeating

structures. Due to the prevalence of low lidar resolution,

camera-based methods cannot typically be applied to lidar

data. Conversely, lidar-based methods cannot typically be

applied to camera data due to a lack of structural information.

However, with the recent availability of high-resolution

lidars, such as the Ouster OS1-128 and Velodyne VLS-

128, we can begin to bridge the gap between camera-based

T. Shan, F. Duarte and C. Ratti are with the Department of Urban Studies

and Planning, Massachusetts Institute of Technology, USA, {shant, fduarte,

ratti}@mit.edu.

B. Englot is with the Department of Mechanical Engineering, Stevens Institute of

Technology, USA, benglot@stevens.edu.

T. Shan and D. Rus are with the Computer Science & Artiﬁcial Intelligence Lab-

oratory, Massachusetts Institute of Technology, USA, {shant, rus}@mit.edu.

Fig. 1: A demonstration of the proposed method applied to a

mapping task. Left: a loop is found when the place is revisited.

Grayscale images are intensity images projected from point clouds.

Green lines connect the matched features. Right: top-view point

cloud map of a parking lot. Red line indicates the traversed

trajectory. Blue segments along with green dots indicate detected

loop closures using our method. Note that features are extracted

from the trafﬁc arrow on the ground for place recognition.

and lidar-based place recognition methods. We refer to such

high-resolution lidar that gives image-quality 3D scans as

imaging lidar. Driven by the prospects of this technology,

we present a method for robust place recognition using

an imaging lidar. We ﬁrst project the high-resolution point

cloud with intensity information onto an intensity image.

We then extract Oriented FAST and rotated BRIEF (ORB)

feature descriptors from the intensity image. The extracted

descriptors are converted into a bag-of-words (BoW) vector,

which forms a compact representation for the original point

cloud. A DBoW database is built with these vectors and

queried for place recognition. If a candidate is found, we

match the ORB descriptors to ensure enough features can

be matched between these two places. To reject matching

outliers, we formulate the matching problem as an optimiza-

tion problem by applying Perspective-n-Point (PnP) Random

Sample Consensus (RANSAC). A representative example of

our method is shown in Figure 1. The main contributions of

our work, which combines techniques from both camera and

lidar-based place recognition methods, are as follows:

• Real-time robust place recognition that is designed for

imaging lidar, and to our knowledge, the ﬁrst that uses

projected lidar intensity images for place recognition.

• The proposed method, which is invariant to sensor

attitude changes, can detect reverse revisiting, and even

upside down revisiting.

• Our method is extensively validated with data gathered

across different scales, platforms, and environments.

II. RELATED WORK

Our work draws upon concepts used in both camera-based

and lidar-based place recognition methods. Due to their low

hardware cost requirement and robustness in texture-rich

arXiv:2103.02111v1 [cs.CV] 3 Mar 2021

下载后可阅读完整内容，剩余6页未读，立即下载

Ashin°

粉丝: 37
资源: 2

基于成像激光雷达的稳健场景识别方法

Imaging_lidar_place_recognition：ICRA 2021-使用成像激光雷达的稳健位置识别

Robust Portfolio Optimization using CVaR.PDF

X-VECTORS ROBUST DNN EMBEDDINGS FOR SPEAKER RECOGNITION 中文.pdf

Robust Object Recognition with Cortex-Like Mechanisms.pdf

robust and adaptive control with aerospace applications.pdf

Robust Design of a Multirotor Aerial Vehicle.pdf

Stable, Robust, and Versatile Multibody Dynamics Animation.pdf

Hybrid-MVS Robust Multi-View Reconstruction With Hybrid.pdf

藏经阁-ROBUST ARTIFICIAL INTELLIGENCE_ WHY AND HOW.pdf

Trainable Frontend For Robust and Far-Field Keyword Spotting.pdf

最新资源