活动理解中的多姿态多目标跟踪

图像处理

英文文献

需积分: 9 162 浏览量更新于2024-09-11 1 收藏 2.08MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"Multi-Pose Multi-Target Tracking for Activity Understanding" 这篇论文主要探讨了在多姿多目标跟踪（Multi-Pose Multi-Target Tracking）在活动理解中的应用，特别关注于复杂场景下的人体行为识别和跟踪。传统的多目标行人跟踪算法通常假设人们保持直立状态，但在实际的活动丰富的视频数据集中，人们会执行各种各样的动作，如捡拾物体、骑自行车、用铲子挖掘以及坐下等，这些动作会导致多种不同的身体姿势。作者对一个广泛应用的基于检测和数据关联的多目标跟踪管道进行了评估，并针对这个活动丰富的数据集提出了关键限制及改进策略。他们指出，在跟踪管道的每个阶段，都存在特定的挑战，需要进行实用的修改以实现对多种活动的稳健跟踪。论文中提出的关键改进包括使用多个针对不同姿势的特定检测器，这样可以更精确地识别出不同姿态的目标。此外，还引入了基于外观的数据关联后处理步骤，该步骤有助于生成非碎片化的轨迹，这对于全面理解活动至关重要。非碎片化轨迹能够确保跟踪结果的连贯性，从而更好地理解和解析视频中的复杂行为。多姿多目标跟踪的挑战在于如何处理目标的快速变化、遮挡、相似外观以及复杂的背景干扰。通过采用特定姿势的检测器，系统能够更有效地区分和跟踪具有不同身体姿势的目标，提高跟踪的准确性。同时，利用外观特征进行数据关联，可以克服由于目标外观改变（如衣物颜色、光照变化）导致的误匹配问题。此外，论文可能还涉及到了以下技术点： 1. 目标检测：使用深度学习或传统方法来定位和识别视频帧中的各个目标，包括人体的不同姿势。 2. 数据关联算法：研究如何有效地将检测到的目标与历史轨迹匹配，以维持目标身份的一致性。 3. 状态估计：可能采用了卡尔曼滤波、粒子滤波或其他形式的动态模型来估计目标的位置、速度和姿态。 4. 姿势估计：通过计算机视觉技术分析目标的身体部分，以确定其精确的姿势。 5. 行为识别：基于跟踪结果，结合上下文信息来识别和分类目标的行为。这篇论文对于理解复杂环境中人体活动的跟踪技术进行了深入研究，并提供了改进现有系统的实用建议，对于推动计算机视觉和人工智能领域的发展具有重要意义。

资源详情

资源推荐

Multi-Pose Multi-Target Tracking for Activity Understanding

Hamid Izadinia

University of Central Florida

Orlando, FL

izadinia@eecs.ucf.edu

Varun Ramakrishna, Kris M. Kitani, Daniel Huber

Carnegie Mellon University

Pittsburgh, PA

vramakri,kkitani,dhuber@cs.cmu.edu

Abstract

We evaluate the performance of a widely used tracking-

by-detection and data association multi-target tracking

pipeline applied to an activity-rich video dataset. In con-

trast to traditional work on multi-target pedestrian track-

ing where people are largely assumed to be upright, we use

an activity-rich dataset that includes a wide range of body

poses derived from actions such as picking up an object, rid-

ing a bike, digging with a shovel, and sitting down. For each

step of the tracking pipeline, we identify key limitations and

offer practical modiﬁcations that enable robust multi-target

tracking over a range of activities. We show that the use

of multiple posture-speciﬁc detectors and an appearance-

based data association post-processing step can generate

non-fragmented trajectories essential for holistic activity

understanding.

1. Introduction

We explore the task of multi-target multi-pose person

tracking for activity-rich surveillance videos using the cur-

rent tracking paradigm of tracking-by-detection and data

association. Advances in robust category-speciﬁc object

detectors [5, 6] have motivated the tracking-by-detection

paradigm, where robust detectors can act as strong obser-

vation models in tracking frameworks. In particular, re-

cent work has shown that a single coarse part-based model

(e.g., 5 to 15 parts) [7, 10, 22] is well-suited for detecting,

representing and tracking upright people. While these ap-

proaches are effective for urban scenarios, such as pedes-

trians walking on sidewalks or people in subway stations,

difﬁculties arise when people perform other activities like

riding a bike, digging a hole, or pushing a cart. Although

methods exist for full body pose estimation [21, 8, 24], they

often assume full body part visibility. In this work, we tar-

get surveillance videos that contain a range of human activ-

ity, beyond walking and standing. We evaluate the strengths

and limitations of state-of-the-art multi-target tracking and

offer practical modiﬁcations to improve performance.

SAFE HOUSE 1SAFE HOUSE 2

ROA D 1ROAD 2

Figure 1. DARPA Mind’s Eye Y2 activity dataset

We proceed with our analysis by dividing the tracking

pipeline into two stages: person detection and data associa-

tion. In the person detection stage, we compare the results

of standard pedestrian detectors against richer models that

encode variations in pose. In particular, we compare four

different deformable part-models (DPMs) and show that

training models explicitly for different postures improves

performance. In the data association stage, we use a state-

of-the-art multi-target data association framework [20] and

examine how the choice of parameters affects the resulting

trajectories. Speciﬁcally, we evaluate the tradeoff between

the recall rate and the number of ID switches as a function

of the parameters. To prevent frequent ID switching and to

preserve longer trajectories, we propose an instance-speciﬁc

trajectory merging process as a post-processing step, that

uses appearance-based cues to make associations over long

periods of time.

The contributions of this paper are as follows: (1) step-

by-step analysis of detection-based data association track-

ing for activity-rich videos; (2) a multi-pose deformable

parts model that allows for robust tracking over pose vari-

ations; and (3) long term data association using target-

speciﬁc appearance-based regressors.

Work on multi-pedestrian tracking is a signiﬁcant ﬁeld

下载后可阅读完整内容，剩余5页未读，立即下载

开车去撒欢

粉丝: 24
资源: 9

活动理解中的多姿态多目标跟踪

讲稿_Robust Multi-Modality Multi-Object Tracking.docx

演示-Robust Multi-Modality Multi-Object Tracking.pptx

YOLOv8 Combined with Object Tracking Technology: Multi-Target Tracking and Scene Analysis Methods

VSCode Debug Toolbox: Achieving Multi-dimensional Code Debugging

帮翻译《BlazePose: On-device Real-time Body Pose tracking》这篇论文

MULTI-HYPOTHESIS TRACKING

常见的用于视频中基于注意力机制的2D人体姿态估计算法有哪些？

emm-cause:no-suitable-cells-in-tracking-area (15)

distractor-aware siamese networks for visual object tracking

towards real-time multi-object tracking

分别详细介绍以下的GCC编译选项的功能原理： -fno-var-tracking-assignments-toggle -fno-var-tracking-uninit -fvariable-expansion-in-unroller -fno-tree-partial-pre -funconstrained-commons -fno-unroll-all-loops -funroll-loops -funsafe-math-optimizations -fno-vpt

目前有哪些top-down方法的姿态估计网络，按年份梳理

siammot: siamese multi-object tracking

LMCF目标跟踪算法的英文文献

bytetrack multi-object tracking by associating every detection box

基于十篇多传感器融合循迹智能车的英文文献，帮我写一篇2000字的文献综述。

用python 写一个爬虫 爬取地址为“https://www.cma-cgm.com/ebusiness/tracking” 并写出一段可以通过此页面滑块验证的代码

用python 写一个爬虫 爬取地址为“https://www.cma-cgm.com/ebusiness/tracking”

github.com/easinal/target-recognition-and-tracking-of-vehicle-system

最新资源

用python 写一个爬虫爬取地址为“https://www.cma-cgm.com/ebusiness/tracking” 并写出一段可以通过此页面滑块验证的代码

用python 写一个爬虫爬取地址为“https://www.cma-cgm.com/ebusiness/tracking”