实时稳健视觉跟踪：结构随机投影与加权最小二乘法

需积分: 9 194 浏览量更新于2024-08-26 收藏 2.58MB PDF 举报

"这篇研究论文探讨了一种使用结构随机投影和加权最小二乘法进行稳健视觉跟踪的新方法。在视觉跟踪领域，稀疏表示基础的跟踪策略近年来得到了广泛的关注。这种方法的核心是线性地使用目标和背景模板来表示每个候选目标，并对表示系数施加稀疏性约束。通过使用ℓ1范数最小化方法获取这些系数后，误差最低的候选目标被认为是跟踪结果。然而，尽管这类追踪器已经展示了良好的系统性能，但如何最大化其性能仍然不明确。此外，特征空间的高维度导致的计算复杂性限制了这些算法在实时应用中的效率。因此，论文提出了一种实时视觉跟踪算法，该算法结合了结构随机投影和加权最小二乘法，旨在解决上述问题并提高跟踪的稳定性和实时性。" 在视觉跟踪中，稀疏表示是一种关键的技术，它试图用尽可能少的基模板（如目标的先前帧）来表示当前帧中的候选目标。通过最小化ℓ1范数，可以得到一个稀疏的系数向量，这个向量指示了每个模板对目标表示的贡献程度。这种方法的优势在于能够有效抵抗环境变化和遮挡，因为只有与目标最相关的几个模板会被选择。然而，高维特征空间的使用会增加计算负担，降低跟踪速度，这在实时应用中尤为关键。为了解决这个问题，论文引入了结构随机投影。这是一种降维技术，通过随机矩阵将高维数据映射到一个低维空间，同时保持数据的主要结构。这种降维不仅可以减少计算复杂性，还能帮助保持数据的稀疏性，从而有利于跟踪性能的提升。加权最小二乘法则是优化问题的一种解决策略，它在寻找最佳表示系数时考虑了不同模板的权重。根据目标和背景模板的相似度，每个模板的权重可能会有所不同。通过加权，可以更准确地识别出哪些模板对于当前的跟踪任务更重要，进一步提高跟踪的准确性。这篇论文的贡献在于提供了一个结合了结构随机投影和加权最小二乘法的实时视觉跟踪框架，该框架有望在保持高跟踪性能的同时，降低计算复杂性，适应快速变化的环境。这一创新方法对于视觉跟踪领域的研究和实际应用都具有重要的理论和实践价值。

ZHANG et al.: ROBUST VISUAL TRACKING USING STRUCTURALLY RP AND WLS 1751

Fig. 1. Overview of the proposed tracking method.

minimization problem, the run time is determined by the

product of the total number of all the PCG steps with all the

iterations and the cost of each PCG step. The total number of

the PCG iterations depends on the value of the regularization

parameter λ. In the experiments with λ = 0.15, the total

number of PCG is approximately a few hundred times. For

a PCG step, the most expensive operator is a matrix–vector

product, which has O (d

+d ×n) computational complexity,

where d is the feature dimensionality and n is the number of

the templates. Motivated by the sparse signal recovery power

of compressive sensing (CS), Li et al. [31] accelerated the



-norm minimization by reducing the feature dimensionality

using a hash table or RP which meets the restricted isometry

property (RIP) [41]. Let  ∈ R

×d

be the projection matrix,

the coefﬁcients α can be computed by

α = arg min

y −Xα

+ λα

. (5)

When we set

d  d, the dimensionality of the 

minimization

is signiﬁcantly reduced while the original high

dimensionality y can still be fully recovered from the

reduced y.

To compute the coefﬁcients shown in (2), we need com-

putationally expensive 

-norm minimization. However, the

particle weights deﬁned in (3) generate a reconstruction

error measured in 

-norm, which has a lower bound

y −Fα



≥y − F ˆα



,where

ˆα

= arg min

y − Fα



. (6)

Instead of reducing the computational complexity of the



-norm minimization, Mei et al. [30] proposed to reduce the

number of 

-norm minimization by excluding unimportant

particles using the reconstruction error bound computed via

fast 

-norm minimization shown in (6).

The aforementioned methods employ sparse representation

to globally encode each target candidate through the target

templates. In the literature, there are also different kinds of

methods [35], [42], which used local sparse representation

to model target appearance. These methods ﬁrst construct a

dictionary from the local patches sampled from the training

images that contain the tracked target, and then use the

dictionary to encode local patches sampled from each target

candidate or template. The coding coefﬁcients are used as

features to describe the appearance of the target candidate

or template. However, due to the locality of the sampling,

these appearance modeling methods have a poor discriminative

ability. To overcome this disadvantage, Zhong et al. [43]

integrated both local and global sparse appearance models.

In [44] and [45], structural sparse appearance modeling

was proposed, which exploited the spatial layout of the

locally sampled patches to increase the discriminative ability.

In [46], a more sophisticated method was proposed, where

discriminative sparse coding was directly used to enhance the

discriminative power of the resulting coding coefﬁcients.

III. P

ROPOSED METHOD

In this section, we present the proposed tracking method

based on structurally random mapping and WLS. In contrast

to 

trackers which only use target templates, our pro-

posed framework uses both target and background templates

to represent each candidate. When the total reconstruction

error is minimized, the target and the background templates

compete against each other in the linear representation. After

reducing the feature dimensionality using structurally random

mapping, we compute the representation coefﬁcients by the

WLS technique. The reconstruction errors obtained by the

target and the background templates are used to discriminate

the target from its background. An overview of the proposed

tracking method is shown in Fig. 1.

A. Tracking Framework

The proposed method is implemented using a sequential

importance sampling (also known as particle ﬁlter) frame-

work [38], [39], which is a popular computation method

to recursively approximate the posterior distribution of state

variables characterizing a dynamic system. It consists of two

stages: prediction and updating. Let z

and I

be the state

variables and the observation at time t, respectively. The

posterior distribution of z

given all the available observations

1:t−1

={I

, I

,...,I

t−1

} up to time t − 1 can be predicated

using the state transition model p(z

t−1

) as

p(z

1:t−1

) =



p(z

t−1

) p(z

t−1

1:t−1

)dz

t−1

. (7)

At time t, the observation I

is available, and the posterior

distribution of z

is updated using the Bayes rule as

p(z

1:t

) =

p(I

) p(z

1:t−1

)

p(I

1:t−1

)

. (8)

Using the sequential importance sampling technique, the

posterior distribution p(z

1:t

) is approximated by a set of

N weighted samples (also called particles) {z

}

i=1,...,N

where w

are the importance weights of particles z

.Let

q(z

1:t

, z

1:t−1

) be the importance distribution from which

剩余11页未读，继续阅读

weixin_38738783

粉丝: 5

实时稳健视觉跟踪：结构随机投影与加权最小二乘法

非负矩阵分解：投影梯度替代最小二乘法解析

使用OpenCV 2.1与VS2008实现最小二乘法

掌握偏最小二乘法：误差平方和的极致优化技术

一种加权迭代自定标算法 (2005年)

一种鲁棒性的射影重建方法――加权迭代法*) (2005年)

基于变量投影法的自回归模型方差分量估计.docx

中科大统计学习作业和编程题r.pdf

Matlab计算机视觉作业解决方案：特征点检测与图像重建

SIMCA14.01模型构建攻略：用主成分分析和偏最小二乘法提升分析效果

MATLAB最小二乘法图像处理指南：图像去噪与增强，提升视觉效果

最新资源