利用高精地图提升3D物体检测性能：HDNET方法

版权申诉

120 浏览量更新于2024-09-09 收藏 1.97MB PDF 举报

在"HDNET: Exploiting HD Maps for 3D Object Detection"这篇论文中，作者探讨了高分辨率（HD）地图如何显著提升现代三维对象检测器的性能和鲁棒性。随着自动驾驶技术的发展，精确的地图信息对于车辆感知环境至关重要。HD地图提供了丰富的道路几何结构、车道线、交通标志等详细信息，这些数据能够作为强大的先验知识，帮助3D物体检测器更准确地识别和定位周围物体。论文提出了一种单阶段的3D物体检测器设计，该检测器能从HD地图中提取出几何和语义特征。这种设计充分利用了地图中的结构信息，使得车辆即使在未知环境中也能依赖地图预测来辅助检测。然而，考虑到地图并非在所有地方都可用，作者还提出了一个实时地图预测模块，它能够根据原始的激光雷达数据动态估计地图，增强了系统的适应性。作者在KITTI数据集[1]上进行了广泛实验，以及在一个包含100万帧的大型3D检测基准测试中验证了方法的有效性。结果表明，这个结合了地图意识的检测器在有地图支持和无地图场景下都能超越当前最先进的技术，显示出显著的优势。此外，整个框架以每秒20帧的速度运行，确保了实时性。论文的关键点包括3D物体检测、HD地图在自动驾驶中的应用以及如何将这些地图信息无缝融入到自动驾驶系统中，以提高车辆的安全性和行驶效率。通过集成高清地图，本文的工作朝着实现更可靠、高效的自主驾驶系统迈出了重要的一步。

HDNET: Exploiting HD Maps for 3D Object

Detection

Bin Yang

1,2

Ming Liang

Raquel Urtasun

1,2

Uber Advanced Technologies Group

University of Toronto

{byang10, ming.liang, urtasun}@uber.com

Abstract: In this paper we show that High-Deﬁnition (HD) maps provide strong

priors that can boost the performance and robustness of modern 3D object detec-

tors. Towards this goal, we design a single stage detector that extracts geometric

and semantic features from the HD maps. As maps might not be available every-

where, we also propose a map prediction module that estimates the map on the ﬂy

from raw LiDAR data. We conduct extensive experiments on KITTI [1] as well as

a large-scale 3D detection benchmark containing 1 million frames, and show that

the proposed map-aware detector consistently outperforms the state-of-the-art in

both mapped and un-mapped scenarios. Importantly the whole framework runs at

20 frames per second.

Keywords: 3D Object Detection, HD Maps, Autonomous Driving

1 Introduction

Autonomous vehicles have the potential of providing cheaper and safer transportation. A typical

autonomous system is composed of the following functional modules: perception, prediction, plan-

ning and control [2]. Perception is concerned with detecting the objects of interest (e.g. vehicles) in

the scene and track them over time. The prediction module estimates the intentions and trajectories

of all actors into the future. Motion planning is responsible for producing a trajectory that is safe,

while control outputs the commands necessary for the self-driving vehicle to execute such trajectory.

3D object detection is a fundamental task in perception systems. Modern 3D object detectors [3, 4]

exploit LiDAR as input as it provides good geometric cues and eases 3D localization when compared

to camera-only approaches. In the context of real-time applications, single-shot detectors [5, 6, 7]

have been shown to be more promising than proposal-based methods [8, 4] as they are very efﬁcient

and can produce very accurate estimates. However, object detection is far from solved as many

challenges remain, such as dealing with occlusion and the sparsity of the LiDAR at long range.

Most self-driving systems have access to High-Deﬁnition (HD) maps that contain geometric and

semantic information about the environment. While HD maps are widely used by motion planning

systems [9, 10], they are vastly ignored by perception systems [11]. In this paper we argue that

HD maps provide strong priors that can boost the performance and robustness of modern object

detectors. Towards this goal, we derive an efﬁcient and effective single-stage detector that operates

in Bird’s Eye View (BEV) and fuses LiDAR information with rasterized maps. Bird’s eye view is

a good representation for 3D LiDAR as it is amenable to efﬁcient inference and retains the metric

space. Since HD maps might not be available everywhere, we also propose a map prediction module

that estimates the map geometry and semantics from a single online LiDAR sweep.

Our experiments on the public KITTI BEV object detection benchmark [1] and a large-scale 3D

object detection benchmark TOR4D [3, 12] show that we can achieve signiﬁcant Average Precision

(AP) gain on top of a state-of-the-art detector by exploiting HD maps. On TOR4D when HD maps

are available, we achieve 2.42%, 3.43% and 5.49% AP gains for ranges over 0-70 m, 30-50 m

and 50-70 m respectively. On KITTI, where HD maps are unavailable, we show that when using

a pre-trained map prediction module (trained on a different continent) we can still get 2.87% AP

gain, surpassing all competing methods including those which also exploit cameras. Importantly,

the proposed map-aware detector runs at 20 frames per second.

2nd Conference on Robot Learning (CoRL 2018), Z

urich, Switzerland.

下载后可阅读完整内容，剩余9页未读，立即下载

电动汽车控制与安全

粉丝: 263
资源: 4186

利用高精地图提升3D物体检测性能：HDNET方法

HDNET- Exploiting HD Maps for 3D Object Detection.zip

点云步态识别代码和数据 dgcnn-hdnet-mmgait-data-STPointGCN-Data

各种函数声明和定义模块

湖北工业大学在河南2021-2024各专业最低录取分数及位次表.pdf

1805.06605v2 DEFENSE-GAN.pdf

【语音去噪】FIR和IIR低通+带通+高通语音信号滤波（含时域频域分析）【含Matlab源码 4943期】.mp4

java-ssm+jsp幼儿园管理系统实现源码(项目源码-说明文档)

hadoop_3_2_0-yarn-resourcemanager-3.3.4-1.el7.x86_64.rpm

DelphiWebMVC-master.zip

东北农业大学在河南2021-2024各专业最低录取分数及位次表.pdf

最新资源