区域级分割与EKF：自动驾驶中的运动估计技术

Motion

需积分: 5 105 浏览量更新于2024-07-06 收藏 10.78MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

本文探讨了"Motion Estimation Using Region-Level Segmentation and Extended Kalman Filter for AD"，着重于自动驾驶（AD）领域中的运动估计技术。作者们，包括Hongjian Wei、Yingping Huang等人，来自上海科技大学光学电子与计算机工程学院，他们针对自动驾驶系统的需求，提出了一个结合区域级分割与扩展卡尔曼滤波器（EKF）的运动估计算法。在文章中，他们可能采用了遥感（remotesensing）的数据作为输入，因为提到了"Remote Sensing"作为关键词。文章的主要贡献在于开发了一种高效的方法来处理自动驾驶车辆在实时环境中面临的复杂动态场景。区域级分割将视频或图像数据分解为较小的、更易处理的部分，这有助于减少计算负担并提高估计精度。扩展卡尔曼滤波器（EKF）作为一种递归算法，被用于融合传感器数据（如摄像头、雷达和激光雷达等）以及运动模型，以估计车辆的位置、速度和方向变化。通过这种方法，作者们旨在提供一种鲁棒且准确的运动估计框架，这对于自动驾驶系统的路径规划、障碍物检测、避障和跟踪至关重要。他们的研究结果可能包括实验验证，展示在不同光照条件、天气变化和道路条件下，区域级分割EKF如何改进传统方法的性能，并降低系统对传感器噪声的敏感性。此外，文章还遵循了Creative Commons Attribution (CC BY) 许可协议，确保了知识的共享和开放获取，这对于推动学术界和工业界的技术交流具有重要意义。该论文于2021年5月接受并通过同行评审后发表在《遥感》杂志上，显示了作者们在该领域的最新研究成果。这篇论文提供了关于自动驾驶车辆运动估计的一种创新策略，结合了区域分割和EKF技术，有望提升自动驾驶系统的稳定性和安全性。对于从事自动驾驶、机器人导航或机器视觉研究的人来说，理解并应用这种技术可以增强现有系统的性能和可靠性。

资源详情

资源推荐

Remote Sens. 2021, 13, 1828 4 of 19

designed to estimate the object’s motion. The output of the camera model was interlay

utilized to calculate the measurement matrix of the EKF. The matrix was designed to map be-

tween the position measurement on the objects in the image domain and the corresponding

vector state in the real world. Hayakawa, et al. [

] predicted 2D ﬂow by PWC-Net and

detected the surrounding vehicles’ 3D bounding box using a multi-scale network. The

ego-motion was extracted from the 2D ﬂow using projection matrix and ground plane

corrected by depth information. A similar approach was used for the estimation of the

relative velocity of surrounding vehicles. The absolute velocity was derived from the

combination of the ego-motion and the relative velocity. The position and orientation of

surrounding vehicles were calculated by projecting the 3D bounding box into the ground

plane. Min and Huang [

] proposed a method of detecting moving objects from the

difference between the mixed ﬂow (caused by both camera motion and object motion) and

the ego-motion ﬂow (evoked by the moving camera). They established the mathematical

relationship between optical ﬂow, depth, and camera ego-motion. Accordingly, a visual

odometer was implemented for the estimation of ego-motion parameters by using ground

points as feature points. The ego-motion ﬂow was calculated from the estimated ego-

motion parameters. The mixed ﬂow was obtained from the correspondence matching

between consecutive images. Zhang, et al. [

] presented a framework to simultaneously

track the camera and multiple objects. The 6-DoF motions of the objects, as well as the

camera, are optimized jointly with the optical ﬂow in a uniﬁed formulation. The object

velocity was calculated using the rotation and translation part of the motion of points in the

global reference frame. The proposed framework detected moving objects via combining

Mask R-CNN object segmentation [

] and scene ﬂow, and tracked them over frames using

optical ﬂow.

Different from the ﬁrst two categories of the methods, the learning-based

method [

] does not require a speciﬁc mathematical estimation model but re-

lies on ma-chine learning and the ability of neural network regression to estimate the

motion parameters. Jain, et al. [

] used Farneback’s algorithm to calculate optical ﬂow and

the DeepSort algorithm to track vehicles detected from the YOLO-v3. The optical ﬂow

and the tracking information of the vehicle were then treated as input for two different

networks. The features extracted from the two networks were stacked to create a new

input for a lightweight Multilayer Perceptron architecture which ﬁnally predicts positions

and velocities. Cao, et al. [

] presented a network for learning motion parameters from

stereo videos. The network masked object instances and predicted speciﬁc 3D scene ﬂow

maps, from which the motion direction and speed for each object can be derived. The

network took the 3D geometry of the problem into account which allows it to correlate

the input images. Kim, et al. [

] developed a deep neural network that exploits different

levels of semantic information to perform the motion estimation. The network used a

multi-context pooling layer that integrates both object and global features, and adopt the

cyclic ordinal regression scheme using binary classiﬁers for effective motion classiﬁca-

tion. In the detection stage, they ran the YOLO-v3 detector to obtain the bounding boxes.

Song, et al. [

] presented an end-to-end deep neural network for estimation of inter-vehicle

distance and relative velocity. The network integrated multiple visual clues provided by

two time-consecutive frames, which include deep feature clue, scene geometry clue, as

well as temporal optical ﬂow clue. It also used a vehicle-centric sampling mechanism to

alleviate the effect of perspective distortion in the motion ﬁeld.

Moving object detection is a prerequisite for motion estimation. Most of the existing

methods use bounding boxes as object proposals which affect the accuracy of the motion

estimation for the late two stages. In this study, we leverage a region-level segmentation to

accurately locate object regions for tracking and parameter estimate. Therefore, we review

here relevant segmentation works compared with our segmentation methods. PSPNet [

]

is a pyramid scene parsing network based on the full convolution network [

], which

exploits the capability of global context information by different-region-based context

aggregation. PSPNet can provide a pixel-level prediction for the scene parsing task. Mask

剩余18页未读，继续阅读

TracelessLe

粉丝: 5w+
资源: 466

区域级分割与EKF：自动驾驶中的运动估计技术

True-motion estimation with 3-D recursive

High-Precision, Consistent EKF-based Visual-Inertial Odometry.pdf

matlab人脸匹配代码-Motion-Estimation-and-LBP-Classification:使用块匹配的运动估计和使用全局描述

Waveform-Design-and-Accurate-Channel-Estimation-for-Frequency-Ho

Channel-Estimation-and-Hybrid-Precoding-for-Millimeter-Wave-Systems-Based-on-Deep-Learning

HSoptflow.rar_ motion estimation_Horn-Schunck_optical flow_光流_运动

Interval-Estimation-of-Individual-Level-Causal-Effects-Under-Unobserved-Confounding

混沌算法的matlab代码-MATLAB-code-for-parameter-and-state-estimation-of-chaotic

matlab自相关代码-Phase-only-Image-Based-Kernel-Estimation-for-Blind-Motion-D

Self-supervised-Monocular-Trained-Depth-Estimation-using-Self-attention-and-Discrete-Disparity-Volum:CVPR 2020论文的复制品-使用自我注意和离散视差量的自我监督单眼训练深度估计

b-splines配准matlab代码-Motion-Estimation-Compressed-Sensing-MRI:运动估计-压缩传感-

psf的matlab代码-Kernel-estimation-from-salient-structure-for-robust-motion

Food-Calories-Estimation-Using-Image-Processing-master

Food-Calories-Estimation-Using-Image-Processing

Traffic-density-estimation-using-OpenCV-functions

-Trajectory-Estimation-using-YOLOV4-

Estimation-of-Remaining-Useful-Life-using-CNN-master.zip

最新资源