多目标跟踪：最新文献综述

需积分: 9 75 浏览量更新于2024-07-09 收藏 1.22MB PDF 举报

本文是一篇深入的文献综述，标题为《多目标跟踪：一篇文献回顾》（Multi-object tracking: A literature review），发表在《人工智能》（Artificial Intelligence）期刊上，卷号为293，2021年。文章主要关注于多目标跟踪（MOT）这一领域的研究进展，这是一个由于其学术价值和商业应用潜力而日益受到重视的问题。多目标跟踪是指在视频或传感器数据中实时追踪多个目标的移动轨迹，这在视频分析、自动驾驶、安全监控和机器人等领域具有重要意义。尽管研究人员已经提出了多种方法来应对这一复杂问题，包括基于特征匹配的数据关联（Data Association）、深度学习的检测与追踪融合、以及运动模型预测等技术，但挑战依然存在。这些问题主要包括目标的突然出现和消失（abrupt appearance changes）、遮挡、相似性导致的目标混淆、以及大规模场景中的高效处理等。文献回顾部分详细探讨了不同方法的优缺点，比如传统的基于特征的方法依赖于稳定且可区分的特征，但可能在复杂环境中的性能受限；而深度学习方法，尤其是基于卷积神经网络（CNN）的目标检测模型，虽然在准确度上有所提升，但对大量标注数据的需求较高，并且可能存在模型泛化能力的挑战。此外，文章还涵盖了近年来新兴的研究方向，如联合检测与跟踪（Joint Detection and Tracking）、多模态融合（Multimodal Fusion）以及基于强化学习的自适应策略等，这些都在一定程度上提高了MOT的性能，但同时也带来了新的研究课题，如如何优化算法效率、提高鲁棒性以及实现实时性。这篇文献综述为多目标跟踪领域的研究者提供了全面的视角，梳理了当前的研究热点和技术路径，同时指出了未来可能的发展趋势。对于那些希望深入了解该领域的人来说，这篇文章是一份宝贵的参考资料。

W. Luo, J. Xing, A. Milan et al. Artiﬁcial Intelligence 293 (2021) 103448

Table 3

Comparison

between DBT and DFT. Adapted from [51].

Item DBT DFT

Initialization automatic, imperfect manual, perfect

# of objects varying ﬁxed

Applications speciﬁc type of objects (in most cases) any type of objects

Advantages ability to handle varying number of objects free of object detector

Drawbacks performance depends on object detection manual initialization

Fig. 2. An illustration of online (left) and oﬄine (right) tracking. (For interpretation of the references to color in this ﬁgure, the reader is referred to the

web version of this article.)

Table 4

Comparison

between online and oﬄine tracking.

Item Online tracking Oﬄine tracking

Input Up-to-time observations All observations

Methodology

Gradually extend existing trajectories

with current observations

Link observations into trajectories

Advantages Suitable for online tasks Obtain global optimal solution theoretically

Drawbacks Suffer from shortage of observation Delay in outputting ﬁnal results

object detector is trained in advance, the majority of DBT focuses on speciﬁc kinds of targets, such as pedestrians, vehicles

or faces. Second, the performance of DBT highly depends on the performance of the employed object detector.

Detection-free tracking. As shown in Fig. 1 (bottom), DFT [54–57]requires manual initialization of a ﬁxed number of

objects in the ﬁrst frame, then localizes these objects in subsequent frames.

DBT is more popular because new objects are discovered and disappearing objects are terminated automatically. DFT

cannot deal with the case that objects appear. However, it is free of pre-trained object detectors. Table 3 lists the major

differences between DBT and DFT.

2.2.2. Processing mode

MOT can also be categorized into online tracking and oﬄine tracking. The difference is whether observations from future

frames are utilized when handling the current frame. Online, also called causal, tracking methods only rely on the past

information available up to the current frame, while oﬄine, or batch tracking approaches employ observations both in the

past and in the future.

Online tracking. In online tracking [54,58,55,56,59,60,165,160], the image sequence is handled in a step-wise manner,

thus online tracking is also named as sequential tracking. An illustration is shown in Fig. 2 (top), with three objects (different

circles) a, b, and c. The green arrows represent observations in the past. The results are represented by the object’s location

and its ID. Based on the up-to-time observations, trajectories are produced on the ﬂy.

Oﬄine tracking. Oﬄine tracking [53,61,49,62,48,1,63–66]utilizes a batch of frames to process the data. As shown in Fig. 2

(bottom),

observations from all the frames are required to be obtained in advance and are analyzed jointly to estimate the

ﬁnal output. Note that, due to computational and memory limitation, it is not always possible to handle all the frames at

once. An alternative solution is to split the data into shorter video clips, and infer the results hierarchically or sequentially

for each batch. Table 4 lists the differences between the two processing modes.

2.2.3. Type of output

This criterion classiﬁes MOT methods into deterministic ones and probabilistic ones, depending on the randomness of

output. The difference between these two types of methods primarily results from the optimization methods adopted as

mentioned in Section 2.1.

Stochastic tracking. The output results of stochastic tracking vary from time to time. For example, in the case of

detection-free tracking, the bounding box results are different if we utilize particle ﬁlter for inference. The difference results

from the randomness of the generation of particles in the processing. Even in the case of detection-based tracking, some

剩余22页未读，继续阅读

TimeRiverForever

粉丝: 108

多目标跟踪：最新文献综述

PyPI下载最新dbnd-airflow-auto-tracking-0.32.6包

PyPI官方发布的dbnd-airflow-auto-tracking-0.41.3

Python库openedx-caliper-tracking-0.14.3的下载指南

object-tracking-particle-filter.zip_COMMAND filter_object tracki

Python库 | django-tracking-model-0.1.3.tar.gz

Python库 | ml-tracking-api-1.0.2.tar.gz

Python库 | dbnd-airflow-auto-tracking-0.38.4.tar.gz

Python库 | dbnd-airflow-auto-tracking-0.43.5.tar.gz

Python库 | dbnd-airflow-auto-tracking-0.28.24.tar.gz

Python库 | dbnd-airflow-auto-tracking-0.47.1.tar.gz

最新资源