稀疏编码追踪：笛卡尔积与岭回归增强

153 浏览量更新于2024-08-27 收藏 1.83MB PDF 举报

"基于产品稀疏编码的强大视觉跟踪" 本文介绍了一种创新的视觉跟踪算法，该算法利用了两个子码本的笛卡尔积来进行稀疏编码。在计算机视觉领域，视觉跟踪是关键任务之一，旨在在连续的视频帧中持续定位和识别目标对象。传统的稀疏编码方法通常面临计算复杂度高、跟踪稳定性差等问题。本文提出的算法则通过分解原始的稀疏编码问题为两个子问题，有效地解决了这些问题。首先，算法的核心在于使用两个子码本来构建一个更大的稀疏表示空间。这种做法显著地增加了稀疏表示的维度，但同时保持了较低的计算成本。这种方法允许算法在高维度空间中更精确地表示和区分目标对象与背景，从而提高了跟踪的准确性。其次，为了优化L1-范数最小化过程，作者引入了岭回归（Ridge Regression）技术。L1-范数最小化常用于稀疏编码中寻找最简表示，但其计算量较大。通过岭回归，算法能有效地剔除那些导致重构误差较大的“外在粒子”，即不相关的特征，减少了不必要的计算负担，提升了跟踪效率。接下来，算法将得到的高维稀疏表示输入到分类器中，如支持向量机（SVM）。分类器通过对候选区域进行评估，选择具有最大响应的区域作为目标对象的新位置。这种策略有助于在复杂的视觉环境中保持对目标的准确锁定，尤其是在目标出现遮挡、形变或光照变化等挑战性情况下。通过在一系列具有挑战性的基准图像序列上的实验，该算法的性能得到了定性和定量的评估。实验结果表明，提出的跟踪算法相对于其他最新算法表现出了优越的跟踪效果，证明了其在复杂视觉跟踪场景中的有效性。关键词：视觉跟踪、产品稀疏编码、L1-范数最小化、岭回归、支持向量机总结来说，这篇研究论文提出了一个基于产品稀疏编码的高效视觉跟踪算法，通过分解稀疏编码问题、利用岭回归优化以及结合高维稀疏表示和分类器，实现了在计算成本较低的情况下实现稳定且准确的目标跟踪。这种方法对于解决计算机视觉中的实时跟踪问题具有重要的理论和实践价值。

Pattern Recognition Letters 56 (2015) 52–59

Contents lists available at ScienceDirect

Pattern Recognition Letters

journal homepage: www.elsevier.com/locate/patrec

Robust visual tracking based on product sparse coding

✩

Huang Hong-tu

∗

, Bi Du-yan, Zha Yu-fei, Ma Shi-ping, Gao Shan, Liu Chang

Aeronautics and Astronautics Engineering College, Air Force Engineering University, Xi’an 710038, China

article info

Article history:

Received 28 September 2014

Available online 16 February 2015

Keywords:

Visual tracking

Product sparse coding

-norm minimization

Ridge regression

Support vector machine

abstract

In this paper, we propose a sparse coding tracking algorithm based on the Cartesian product of two sub-

codebooks. The original sparse coding problem is decomposed into two sub sparse coding problems. And the

dimension of sparse representation is intensively enlarged at a lower computational cost. Furthermore, in

order to reduce the number of L

-norm minimization, ridge regression is employed to exclude the substantive

outlying particles according to the reconstruction error. Finally the high-dimension sparse representation

is put into the classiﬁer and the candidate with the maximal response is considered as the target. Both

qualitative and quantitative evaluations on challenging benchmark image sequences demonstrate that the

proposed tracking algorithm performs favorably against several state-of-the-art algorithms.

1. Introduction

Visual tracking has long been playing a critical role in numerous

applications such as surveillance, military reconnaissance, motion

recognition and traﬃc monitoring, to name a few [1]. While much

progress has been made within the last decades, it still remains

challenging in many scenarios including pose variation, illumination

change, partial occlusion, motion blur, background clutter and so on.

In the past few years, variation and extension of L

-norm mini-

mization have been applied to many computer vision tasks, including

face recognition, image super-resolution, denoising, inpainting and

image classiﬁcation [2]. Inspired by the success of sparse representa-

tion in face recognition [3], many researchers develop a robust visual

tracking framework by casting the tracking as a sparse approxima-

tion on the codebook [4]. A thorough review can refer to [5].The

sparse coding visual tracking algorithms can be classiﬁed into two

categories, generative model and discriminative model. Both of them

require obtaining the sparse representation ﬁrstly. And the approach

to sparse representation is a L

-norm minimization problem, which

can be solved by homotopy method, gradient projection method, it-

erative shrinkage-thresholding method, interior-point method and

so on [6]. As we know sparse coding is a competitive method given

suﬃciently large codebooks [7]. However, sparse coding is compu-

tationally expensive and the computational cost increases sharply

with the size of the codebook. So its power is mostly limited by the

size of the codebook in practice, especially for discriminative sparse

✩

This paper has been recommended for acceptance by A. Fernandez-Caballero.

∗

Corresponding author. Tel.: +86 29 84787724; fax: +86 29 84787724.

E-mail address: huanghongtu@sina.cn (H. Hong-tu).

coding tracking algorithm. So many researchers have to make a trade-

off between the speed and the discriminative ability. Given a proper

computational cost, how to enlarge the codebook to improve the dis-

criminative power is urgent to be solved.

In this paper we propose a robust product sparse coding tracking

algorithm. And the codebook size is increased in product manner at a

lower computational cost than direct operation on the Cartesian prod-

uct of two sub-codebooks [7]. The original sparse coding problem is

decomposed into two sub sparse coding problems. Each codeword in

the codebook is divided into two equal parts. Then the sparse repre-

sentation of the candidate can be obtained on the two sub-codebooks

simultaneously. And the ﬁnal sparse representation can be calculated

via the product of the two obtained sparse coding coeﬃcients. Finally

the high-dimension sparse representation is input into the SVM clas-

siﬁer and the candidate with the maximal score is regarded as the

target. In order to reduce the number of L

-norm minimization, ridge

regression is adopted to exclude the candidates with big reconstruc-

tion error at a lower computational cost. After that, tracking is led by

the Bayesian state inference framework in which a particle ﬁlter is

used for propagating sample distributions over time. Numerous ex-

periments on various challenging sequences show that the proposed

algorithm performs favorably against state-of-the-art methods and

the tracker based on product sparse coding is superior to the original

sparse coding tracker under the same condition.

The rest of the paper is organized as follows. In Section 2,webe-

gin with summarizing the related work on sparse coding tracking. In

Section 3, we offer the details of the sparse representation based on

product sparse coding. Section 4 is the initialization and generaliza-

tion analysis of the SVM classiﬁer used in our paper. The integration

of our proposed model in particle ﬁlter framework for tacking is de-

scribed in Section 5. Qualitative and quantitative evaluations of our

http://dx.doi.org/10.1016/j.patrec.2015.01.014

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38654315

粉丝: 5
资源: 962

稀疏编码追踪：笛卡尔积与岭回归增强

云计算-基于视觉计算的运动目标跟踪及异常行为分析.pdf

通过时间集成框架进行可靠的在线视觉跟踪

基于深度学习的人脸跟踪自动初始化方法.pdf

基于稀疏编码的超分辨率算法c++代码

稀疏编码和参数化自编码器

用python实现稀疏编码模型

Matlab怎么提高稀疏编码器的准确率

用python实现稀疏编码器

matlab 稀疏编码代码

基于稀疏表示的人脸识别matlab

最新资源