深度学习下Metropolis-Hasting重采样法的在线目标跟踪

89 浏览量更新于2024-08-30 收藏 765KB PDF 举报

在线目标跟踪是计算机视觉领域中的一个重要课题，尤其是深度学习策略在解决视觉追踪中的复杂问题上展现出了显著的效果。然而，如何在非线性追踪框架中有效地应用深度学习模型，如卷积神经网络（CNN），仍然是一个挑战。本文的主要贡献在于提出了一个结合CNN的在线目标跟踪方法，特别针对直接受整合可能导致的过拟合问题。论文标题"Online Object Tracking Based on CNN with Metropolis-Hasting Re-Sampling"聚焦于研究一种新颖的策略，即通过构建一个基于CNN的自适应外观模型来解决在线追踪问题。这个模型能够在追踪过程中动态生成更多可靠的学习数据，从而提升整个追踪系统的性能。传统的CNN模型可能在处理实时场景中的目标变化时，由于数据量有限或者模型泛化能力不足而出现过拟合现象，因此，论文的核心创新点在于提出了一种Metropolis-Hastings重采样算法，用于重塑粒子分布并增强模型的鲁棒性。 Metropolis-Hastings算法是一种经典的马尔可夫链蒙特卡洛（MCMC）方法，它被用来改进粒子滤波器在在线追踪中的样本分布。通过这种方式，算法能够在每次迭代中根据当前观测信息调整模型参数，避免了直接将CNN模型固定不变导致的适应性不足。这种方法通过模拟退火的思想，能够在有限的训练数据下不断优化模型，确保模型能够随着目标对象的移动和环境变化而动态调整。具体步骤包括：首先，使用训练数据集初始化CNN模型，然后在追踪过程中，每当接收到新的观测数据，模型会利用Metropolis-Hastings算法进行重采样，筛选出最有可能代表当前目标状态的粒子。这些粒子将作为下一轮学习的输入，以生成更准确的特征表示。最后，通过这些更新后的粒子，网络进行微调，进一步优化其对目标外观的识别能力，提高追踪的稳定性和精度。这篇论文提供了一个有效的在线目标追踪框架，通过结合CNN和Metropolis-Hastings重采样技术，实现了在非线性追踪场景下深度学习模型的有效应用。这种动态学习和调整策略不仅有助于解决过拟合问题，还提升了追踪系统在复杂视觉环境下的鲁棒性和准确性。这为未来的研究者提供了宝贵的经验，特别是在实时视频监控、机器人导航等需要高效目标跟踪的应用领域。

Online Object Tracking Based on CNN with

Metropolis-Hasting Re-Sampling

Xiangzeng Zhou and Lei Xie

∗

School of Computer Science

Northwestern Polythechnical University

Xi’an, P. R. China

xenuts@gmail.com, lxie@nwpu.edu.cn

Peng Zhang

∗

and Yanning Zhang

School of Computer Science

Northwestern Polythechnical University

Xi’an, P. R. China

{zh0036ng, ynzhang}@nwpu.edu.cn

ABSTRACT

Tracking-by-learning strategies have been eﬀective in solv-

ing many challenging problems in visual tracking, in which

the learning sample generation and labeling play important

roles for ﬁnal performance. Since the concern of deep learn-

ing based approaches has shown an impressive performance

in diﬀerent vision tasks, how to properly apply the learning

model, such as CNN, to an online tracking framework is still

challenging. In this paper, to overcome the overﬁtting prob-

lem caused by straight-forward incorporation, we propose

an online tracking framework by constructing a CNN based

adaptive appearance model to generate more reliable train-

ing data over time. With a reformative Metropolis-Hastings

re-sampling scheme to reshape particles for a better state

posterior representation during online learning, the proposed

tracking outperforms most of the state-of-art trackers on

challenging benchmark video sequences.

Categories and Subject Descriptors

I.4.8 [Image Processing and Computer Vision]: Scene

Analysis—Tracking

General Terms

Algorithm, Theory

Keywords

Object tracking, CNN, Metropolis-Hastings, Re-sampling

1. INTRODUCTION

Learning sample quality is an essential factor to robust on-

line tracking, but this task is not easy because it is hard to

manually intervene the sample generation and labeling when

tracking is on-the-ﬂy. Although diﬀerent tracking strategies

have tried various types of traditional models for sample gen-

eration[15], the descriptive capability of those online sample

∗

Corresponding author.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full cita-

tion on the ﬁrst page. Copyrights for components of this work owned by others than

ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-

publish, to post on servers or to redistribute to lists, requires prior speciﬁc permission

and/or a fee. Request permissions from Permissions@acm.org.

MM’15, October 26–30, 2015, Brisbane, Australia.

 2015 ACM. ISBN 978-1-4503-3459-4/15/10 ...$15.00.

DOI: http://dx.doi.org/10.1145/2733373.2806307.

is still far from suﬃcient for object characteristic represen-

tation.

In order to exploit more descriptive training samples, nowa-

days, deep learning models, e.g. convolutional neural net-

work (CNN) [16], have been successfully applied in a va-

riety of audio and visual tasks such as speech recognition

and image classiﬁcation, and obtain a remarkable progress.

But due to the requirement of a large number of training

data and high computational cost, most of those studies

approached their tasks with oﬀ-line learning process as pre-

sented in some recently proposed works [14, 11, 7]. Wang

et al. [14] proposed an online tracking strategy based on a

compact image representation learned from an oﬀ-line pre-

trained deep neural network which requires large amounts

of auxiliary images. Similarly, Hong et al. [7] carried out the

learning of discriminative saliency using a CNN, but still de-

manded a pre-trained model. Diﬀerent with [14] and [7], Li

et al. [11] proposed a variation of CNN with truncated struc-

tural loss to construct an online tracker and showed promis-

ing performance. But it mainly focused on model reforming

of CNN for online learning, and the sample generation prob-

lem is not addressed, which may lead to tracking failure in

complicated scenarios. Thus, how to utilize the advantage

of deep learning to generate more representative samples is

a challenging problem in online tracking tasks, and this is

also a motivation of this study.

Sample labeling is another challenge to properly utilize a

CNN model as learning strategy for online tracking. This

is because CNN is prone to overﬁtting to recent samples

and is sensitive to mislabeled samples. Typically, a particle

ﬁlter is used for eﬃciently conducting online object track-

ing by simulating object state’s posterior with a ﬁnite set

of weighted particles. However, it is diﬃcult for the parti-

cles being used to carry out a self-repair without any prior

knowledge when a speciﬁc error pattern arises. Such type

of error may be caused by an incorrect object’s interference

due to dramatic appearance change or overﬁtting problem

(e.g. CNN based appearance model). Therefore, an eﬀective

re-sampling process over particle ﬁlter may beneﬁt the label

assignment, providing more reliable labeled samples for the

learning of CNN model. This is another motivation leading

to this study.

In this work, we propose a robust online tracker by ex-

ploiting the strong learning capability of a CNN model with

particle ﬁltering framework. An overview of our tracking

framework is shown in Fig. 1. The contributions of the pro-

posed work are three folds. Firstly, we carry out an at-

tempt by introducing a single convolutional neural network

1163

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38739837

粉丝: 2
资源: 912

深度学习下Metropolis-Hasting重采样法的在线目标跟踪

Gait-Tracking-With-x-IMU-master.rar_Gait Tracking_IMU_去除重力

Online Visual Tracking by Huchuan Lu-June 1, 2019.epub

UAV-auto-navigation-and-object-tracking-based-on-RL-main

Online Tracking

分别详细介绍以下的GCC编译选项的功能原理： -fno-var-tracking-assignments-toggle -fno-var-tracking-uninit -fvariable-expansion-in-unroller -fno-tree-partial-pre -funconstrained-commons -fno-unroll-all-loops -funroll-loops -funsafe-math-optimizations -fno-vpt

请解决以下代码的问题ui_mainwindow.h:614:10: note: variable tracking size limit exceeded with -fvar-tracking-assignments, retrying without void setupUi(QMainWindow *MainWindow)

请帮我找一些关于“基于FANUC机器人的2D视觉追踪系统”相关内容的论文

介绍Online Tracking

In Defense of Color-based Model-free Tracking

event-based vision for object tracking

最新资源