增量学习：增强视觉跟踪的稳健性

需积分: 20 96 浏览量更新于2024-07-23 收藏 1.87MB PDF 举报

"Incremental Learning for Robust Visual Tracking" 这篇论文主要探讨了增量学习在稳健视觉跟踪中的应用。视觉跟踪是一项处理非平稳图像流的技术，它随着时间的推移而变化。尽管许多现有的算法在受控环境下能有效地跟踪物体，但在目标外观或周围光照显著变化的情况下，它们通常会失败。其中一个主要原因在于，这些算法往往使用固定的目标外观模型，这些模型仅基于跟踪开始前可用的外观数据进行训练。这实际上限制了可以建模的外观范围，并忽略了在跟踪过程中可以获得的大量信息，如形状变化或特定照明条件。作者David A. Ross、Jongwoo Lim、Ruei-Sung Lin和Ming-Hsuan Yang提出了一种新的跟踪方法，该方法采用增量学习来逐步学习低维子空间表示，能够高效地在线适应。这种方法的优势在于，它能够不断地更新和扩展模型，以适应目标物体在跟踪过程中的动态变化，如形状、光照、遮挡等因素引起的外观变化。增量主成分分析（Incremental PCA）是该方法的核心，它允许算法在新样本到来时逐步更新模型，而不是一次性处理所有数据。这种方法减少了计算复杂性，使得算法能在实时跟踪中快速响应环境变化。通过保留关键特征并丢弃噪声，增量PCA能够保持模型的准确性，同时降低过拟合的风险。在论文中，作者们详细介绍了算法的实现步骤和优化策略，包括如何选择新样本、如何更新主成分以及如何处理跟踪丢失和重识别的问题。他们还对多种视觉跟踪挑战场景进行了实验，验证了所提方法的鲁棒性和有效性，与传统的固定模型相比，该方法在各种复杂的视觉跟踪任务中表现出更好的性能。这篇经典论文提供了一种基于增量学习的视觉跟踪解决方案，它克服了传统方法的局限性，增强了跟踪系统的适应性和稳定性。这项工作对于理解视觉跟踪的动态学习原理，以及在实际应用中解决跟踪难题具有重要的参考价值。

128 Int J Comput Vis (2008) 77: 125–141

3 Incremental Learning for Tracking

We present details of the proposed incremental learning al-

gorithm for object tracking in this section. First we propose

an efﬁcient method that incrementally updates an eigenbasis

as new observations arrive, which is used to learn the ap-

pearance of the target while tracking progresses. Next we

describe our approach for drawing particles in the motion

parameter space and predicting the most likely object loca-

tion with the help of the learned appearance model. Collec-

tively, we show how these two modules work in tandem to

track objects well under varying conditions.

3.1 Incremental Update of Eigenbasis and Mean

The appearance of a target object may change drastically

due to intrinsic and extrinsic factors as discussed earlier.

Therefore, to produce a robust tracker, it is important to

adapt the appearance model online, while tracking, to re-

ﬂect these changes. The appearance model we have chosen,

a eigenbasis, is typically learned off-line from a set of train-

ing images {I

,...,I

}, by taking computing the eigenvec-

tors U of the sample covariance matrix

n−1



i=1

−



, where

I =



i=1

is the sample mean of the

training images. Equivalently one can obtain U by comput-

ing the singular value decomposition UΣV



of the centered

data matrix [(I

−

I)···(I

−

I)], with columns equal to the

respective training images minus their mean.

Adapting the appearance model to account for novel

views of the target can be thought of as retraining the eigen-

basis with an additional m images {I

n+1

,...,I

n+m

},for

some value of m. Naively, this update could be performed

by computing the singular value decomposition U



T

of the augmented (centered) data matrix [(I

−



) ···

n+m

−



)], where



is the average of the entire n + m

training images.

Unfortunately this approach is unsatisfactory for online

applications, like visual tracking, due to its storage and com-

putational requirements. First, the naive approach uses the

entire set of training images for each update. If an update is

made at each video frame, then the number of images which

must be retained grows linearly with the length of the se-

quence. Second, the cost of computing the mean and singu-

lar value decomposition grows with the number of images,

so the algorithm will run ever slower as time progresses. In-

stead, the requirements of our application dictate that any

algorithm for updating the mean and eigenbasis must have

storage and computational requirements that are constant,

regardless of the number of images seen so far.

Numerous, more-sophisticated algorithms have been de-

veloped to efﬁciently update an eigenbasis as more data

arrive (Golub and Van Loan 1996;Halletal.1998;Levy

and Lindenbaum 2000; Brand 2002). However, most meth-

ods assume the sample mean is ﬁxed when updating the

eigenbasis, or equivalently that the data is inherently zero-

mean. Neither assumption is appropriate in our application.

An exception is the method by Hall et al. (2002), which

does consider the change of the mean as each new datum

arrives. Although similar to our (independently-developed)

algorithm, it lacks the forgetting factor, which hurts its suit-

ability for tracking, and has a greater computational cost.

(Both of these disadvantages are demonstrated quantita-

tively in Sect. 4.3.) Part of the additional complexity comes,

because Hall’s algorithm is based on the notion of adding

eigenspaces, from computing the eigenvalue decomposition

of each block of new data as it arrives. In this respect our

algorithm is simpler, since it incorporates new data directly,

without the additional step.

Here we extend one of these efﬁcient update proce-

dures—the Sequential Karhunen–Loeve (SKL) algorithm of

Levy and Lindenbaum (2000)—presenting a new incremen-

tal PCA algorithm that correctly updates the eigenbasis as

well as the mean, given one or more additional training data.

Our algorithm, a variation of which was ﬁrst presented in

Lim et al. (2005), has also been applied to algorithms where

the subspace mean plays an important role. For example, it

can be applied to adaptively update the between-class and

within-class covariance matrices used in Fisher linear dis-

criminant analysis (Lin et al. 2005). We begin with a sum-

mary of the SKL algorithm, then describe our new incre-

mental PCA algorithm, and follow with a discussion of a

forgetting factor which can be used to down-weight the ef-

fect of earlier observations on the PCA model.

Putting aside for the moment the problem of the sample

mean, suppose we have a d ×n data matrix A ={I

,...,I

}

where each column I

is an observation (a d-dimensional

image vector in this paper), for which we have already com-

puted the singular value decomposition A =UΣV



. When

a d ×m matrix B of new observations is available, the goal

is to efﬁciently compute the SVD of the concatenation of A

and B: [AB]=U





. Letting

B be the component of

B orthogonal to U , we can express the concatenation of A

and B in a partitioned form as follows:











ΣU







0 I



. (1)

Let R =



ΣU





, which is a square matrix of size

k +m, where k is the number of singular values in Σ.The

SVD of R, R =



, can be computed in constant time

regardless of n, the initial number of data. Now the SVD of

[AB] can be expressed as





















0 I



Since an incremental PCA is only interested in comput-

ing U



and Σ



, V



, whose size scales with the number of

剩余16页未读，继续阅读

VisualTracker

粉丝: 1
资源: 1

增量学习：增强视觉跟踪的稳健性

Incremental Learning for Robust Visual Tracking.

Incremental-Learning-for-Tracking.rar_INCREMENTAL LEARNING_目标跟踪

Robust Online Visual Tracking在线视觉跟踪

代驾应用系统 SSM毕业设计 附带论文.zip

线上书籍查阅系统 SSM毕业设计 附带论文.zip

c语言教工工资管理系统.rar

绘制数论中的涡旋图，用html，css，javascript实现

(源码)基于XilinxFPGA加速的面部评分系统.zip

PHP学生成绩查询(源代码+论文).rar

毕业设计&课设_lihait 的项目：包含 KNN、协同过滤推荐等算法的机器学习代码实现，编程语言为 Python.zip

最新资源

代驾应用系统 SSM毕业设计附带论文.zip

线上书籍查阅系统 SSM毕业设计附带论文.zip