深度学习端到端可解释神经运动规划器

版权申诉

81 浏览量更新于2024-09-09 收藏 3.2MB PDF 举报

"End-to-end Interpretable Neural Motion Planner.pdf" 这篇论文提出了一种端到端可解释的神经运动规划器，旨在使自动驾驶汽车能够在复杂的都市环境中学习如何处理交通灯、礼让以及与其他道路用户交互。该模型综合考虑了原始激光雷达数据和高精度地图输入，输出可解释的中间表示形式，包括3D检测及其未来轨迹，以及定义了自驾车在规划时间内每个位置优劣的成本体积。首先，这个模型的独特之处在于它能够处理原始的LIDAR数据和高清地图信息。LIDAR数据提供了周围环境的精确三维感知，而高清地图则包含了道路结构和其他关键信息，如车道线、交通标志等。通过将这两种数据源结合，模型能够构建出一个详尽的驾驶环境模型。其次，模型生成的中间表示是可解释的3D检测和未来轨迹。这些3D检测帮助系统识别并理解周围的车辆、行人和其他物体，而未来轨迹预测则允许系统预测这些对象的行为，以便做出相应的决策。这种可解释性对于确保系统的安全性和可靠性至关重要，因为它允许人类开发者理解和验证模型的决策过程。接下来，模型通过生成一个成本体积来评估自驾车可能采取的每一个位置。这个成本体积考虑了各种因素，如交通规则、障碍物的距离、速度限制等，为每个位置分配了一个成本值。模型会从所有可能的物理路径中选择具有最低学习成本的路径，这使得它能自然地捕捉到多模态行为，适应复杂和不确定的道路情况。最后，论文通过在北美多个城市的实际驾驶数据上进行实验，证明了该方法的有效性。实验结果表明，该方法能够成功应对真实世界的驾驶挑战，包括交通灯控制、礼让规则以及其他道路用户的交互。 "End-to-end Interpretable Neural Motion Planner"是一种创新的自动驾驶解决方案，它结合了深度学习的预测能力与可解释性，以实现更加智能、安全的城市驾驶。这种方法对于推动无人驾驶技术的发展，尤其是解决复杂城市环境中的驾驶问题，具有重要意义。

End-to-end Interpretable Neural Motion Planner

Wenyuan Zeng

1,2∗

Wenjie Luo

1,2∗

Simon Suo

1,2

Abbas Sadat

Bin Yang

1,2

Sergio Casas

1,2

Raquel Urtasun

1,2

Uber Advanced Technologies Group

University of Toronto

{wenyuan,wenjie,suo,abbas,byang10,sergio.casas,urtasun}@uber.com

Abstract

In this paper, we propose a neural motion planner for

learning to drive autonomously in complex urban scenar-

ios that include trafﬁc-light handling, yielding, and interac-

tions with multiple road-users. Towards this goal, we design

a holistic model that takes as input raw LIDAR data and a

HD map and produces interpretable intermediate represen-

tations in the form of 3D detections and their future trajec-

tories, as well as a cost volume deﬁning the goodness of

each position that the self-driving car can take within the

planning horizon. We then sample a set of diverse physi-

cally possible trajectories and choose the one with the min-

imum learned cost. Importantly, our cost volume is able to

naturally capture multi-modality. We demonstrate the ef-

fectiveness of our approach in real-world driving data cap-

tured in several cities in North America. Our experiments

show that the learned cost volume can generate safer plan-

ning than all the baselines.

1. Introduction

Self-driving vehicles (SDVs) are going to revolutionize

the way we live. Building reliable SDVs at scale is, how-

ever, not a solved problem. As is the case in many appli-

cation domains, the ﬁeld of autonomous driving has been

transformed in the past few years by the success of deep

learning. Existing approaches that leverage this technology

can be characterized into two main frameworks: end-to-end

driving and traditional engineering stacks.

End-to-end driving approaches [3, 24] take the output of

the sensors (e.g., LiDAR, images) and use it as input to a

neural net that outputs control signals, e.g., steering com-

mand and acceleration. The main beneﬁt of this framework

is its simplicity as only a few lines of code can build a model

and labeled training data can be easily obtained automati-

cally by recording human driving under a SDV platform. In

practice, this approach suffers from the compounding error

∗

denotes equal contribution.

due to the nature of self-driving control being a sequential

decision problem, and requires massive amounts of data to

generalize. Furthermore, interpretability is difﬁcult to ob-

tain for analyzing the mistakes of the network. It is also

hard to incorporate sophisticated prior knowledge about the

scene, e.g. that vehicles should not collide.

In contrast, most self-driving car companies, utilize a

traditional engineering stack, where the problem is divided

into subtasks: perception, prediction, motion planning and

control. Perception is in charge of estimating all actors’ po-

sitions and motions, given the current and past evidences.

This involves solving tasks such as 3D object detection and

tracking. Prediction

, on the other hand, tackles the prob-

lem of estimating the future positions of all actors as well

as their intentions (e.g., changing lanes, parking). Finally,

motion planning takes the output from previous stacks and

generates a safe trajectory for the SDV to execute via a con-

trol system. This framework has interpretable intermediate

representations by construction, and prior knowledge can be

easily exploited, for example in the form of high deﬁnition

maps (HD maps).

However, solving each of these sub-tasks is not only

hard, but also may lead to a sub-optimal overall system

performance. Most self-driving companies have large en-

gineering teams working on each sub-problem in isolation,

and they train each sub-system with a task speciﬁc objec-

tive. As a consequence, an advance in one sub-system does

not easily translate to an overall system performance im-

provement. For instance, 3D detection tries to maximize

AP, where each actor has the same weight. However, in

a driving scenario, high-precision detections of near-range

actors who may inﬂuence the SDV motion, e.g. through in-

teractions (cutting in, sudden stopping), is more critical. In

addition, uncertainty estimations are difﬁcult to propagate

and computation is not shared among different sub-systems.

This leads to longer reaction times of the SDV and make the

overall system less reliable.

In this paper we bridge the gap between these two frame-

works. Towards this goal, we propose the ﬁrst end-to-

We’ll use prediction and motion forecasting interchangeably.

下载后可阅读完整内容，剩余9页未读，立即下载

电动汽车控制与安全

粉丝: 267
资源: 4186

深度学习端到端可解释神经运动规划器

End-to-end Interpretable Neural Motion Planner.zip

Is Attention Interpretable.pdf

2019-InVivo AI-Towards Interpretable Sparse Graph Representation

Python库 | azureml_explain_model-1.0.65-py3-none-any.whl

Interpretable Machine Learning by Christoph Molnar .pdf

Machine+Learning+Methods+for+Behaviour+Analysis+and+Anomaly+Detection-2018.pdf

“见微”系列之一：打开AI黑箱，探索可解释性-1018-太平洋证券-24页.pdf

A tensorized logic programming language for large-scale data.pdf

语谱图生成代码matlab-Interpretable-CNN-for-Big-Five-Personality-Traits-using-A

PyPI 官网下载 | alibi-0.5.6-py3-none-any.whl

最新资源