基于EMD的凸包模型在视觉追踪中的应用

115 浏览量更新于2024-08-27 收藏 621KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇论文提出了一种基于凸包和地球移动距离(EMD)的视觉追踪方法，旨在解决目标在部分遮挡、光照变化、快速运动等情况下出现显著外观变化时的跟踪问题。传统的外观模型通常依赖于前一帧的追踪结果作为目标模板，通过线性组合这些模板来构建目标外观模型。然而，这种方法在面临剧烈外观变化时不够稳健。作者们提出了一种新的粒子滤波框架下的追踪算法，采用创新的外观模型，将目标候选表示为由其特征点构成的凸包，并利用EMD来度量与历史外观的相似性。" 在视觉追踪领域，建立一个能够适应各种环境因素影响的有效目标外观模型至关重要。文章指出，常见的外观模型基于前一帧的跟踪结果，将这些结果作为模板并通过线性组合来构建模型。但这种方法在面对诸如部分遮挡、光照变化或快速运动导致的显著外观变化时，可能会导致跟踪性能下降。为了提高跟踪的鲁棒性，论文提出了一个新的跟踪算法。该算法引入了凸包(Convex Hull)的概念，将目标候选表示为一个由其关键特征点构成的几何形状。凸包能够包容目标的所有部分，即使在部分遮挡的情况下也能保持对目标的整体描述。此外，结合地球移动距离(EMD)，该方法能够量化新目标候选与历史外观之间的相似度。EMD是一种衡量两个分布之间差异的距离度量，特别适用于衡量形状和大小的变化。粒子滤波(Particle Filter)是该算法的框架，它是一种递归贝叶斯过滤器，通过模拟一系列随机样本（即粒子）来近似目标的后验概率分布。每个粒子代表一种可能的目标状态，而EMD则用于更新这些粒子的权重，使得与当前观察更匹配的粒子获得更高的权重，从而改进跟踪性能。这篇论文提出的凸包-EMD方法为视觉追踪提供了一种更为灵活和适应性强的外观模型，能够在目标外观发生显著变化时保持跟踪的准确性和稳定性。通过结合几何结构（凸包）和统计距离度量（EMD），算法能够更好地应对复杂视觉场景中的挑战，例如光照变化、遮挡和快速运动。这一创新为视觉追踪领域带来了新的思路，有助于提升跟踪算法在实际应用中的效果。

资源详情

资源推荐

CONVEX HULL FOR VISUAL TRACKING WITH EMD

Jun Wang, Yuanyun Wang, Chengzhi Deng*, Shengqian Wang, Huasheng Zhu

1 Jiangxi Province Key Laboratory of Water Information Cooperative Sensing and Intelligent

Processing, Nanchang Institute of Technology, Nanchang 330099, China

2 School of Information Engineering, Nanchang Institute of Technology, Nanchang 330099, China

3 Research Laboratory of Cooperative Sensing and Advanced Computing Techniques,

Nanchang Institute of Technology, Nanchang 330099, China

ABSTRACT

Developing an eﬀective target appearance model is a chal-

lenging task due to the inﬂuence of factors such as partial

occlusion, illumination variations, fast motion, etc. Ex-

isting appearance models usually utilize the tracking re-

sults from previous frames as target templates upon which

the target appearance model is built by linear combination-

s of the templates. With such kind of representation, vi-

sual tracking is not robust when drastic appearance vari-

ations occur. We propose a simple but eﬀective tracking

algorithm with a novel appearance model in a particle ﬁl-

ter framework. A target candidate is represented by the

convex combination of a set of target templates. Addition-

ally, the distance between a target candidate and the tem-

plates is measured using the EMD. Experimental results on

challenging video sequences against state-of-the-art algo-

rithms demonstrate the robustness and eﬀectiveness of the

proposed tracking algorithm.

Index Terms— Visual tracking, Convex Hull, Particle

ﬁlter, Earth Mover’s Distance, Appearance model

1. Introduction

Visual tracking is to continually locate the locations of

a target across a video sequence. It has a wide range of ap-

plications such as human-interaction, visual surveillance

and video retrieval. Despite much progress has been made

in the past decades [1], visual tracking remains a challeng-

ing task due to a number of challenges such as illumination

variations, partial occlusion, background clutters, motion

blur and out-of-plane rotation. In visual tracking, a robust

appearance model is important for the precision and suc-

cess rate, which should be robust to signiﬁcant appearance

variations. Based on the appearance model, tracking track-

ing can be categorized as either generative[3, 2, 4, 5, 6] or

discriminative [7, 8, 9, 10, 11].

Generally speaking, generative tracking algorithm-

s usually learn a target appearance model to represent

the target and search the most similar image region in

each frame using the learnt appearance model. Unlike

generative tracking algorithms, discriminative tracking al-

gorithms consider visual tracking as a binary classiﬁcation

*Corresponding author.

problem. A learnt classiﬁer is used to distinguish a tar-

get from the surrounding backgrounds. Here, we brieﬂy

review some typical tracking algorithms that relate to our

work.

IVT algorithm [2] learns an incremental subspace mod-

el to represent a target candidate undergoing the principle

component analysis (PCA). The appearance model [2] is

robust to illumination variations. However, it is s ensitive

to background clutters. In [3], a target candidate is divided

into multiple non-overlapping image patches. Each patch

is described as a histogram. Due to the ﬁxed target tem-

plate is used, the drift problem is alleviated. However, it is

sensitive to illumination variations. Kwon et al. [6] sample

a set of trackers to handle signiﬁcant appearance and mo-

tion variations. Each tracker is a basic tracker that is robust

to an appearance variation or a motion variation.

Sparse representation mehtods [12] have been applied

to visual tracking [13, 14, 15, 16]. In L1 algorithm [13], a

target candidate is jointly represented by target templates

and trivial templates. The target templates are used to rep-

resent the foreground target, and the trivial templates are

used to described occlusions. Based on sparse representa-

tion t echniques, Zhong et al. [15] propose a robust tracking

algorithm using a sparsity-based collaborative model. Re-

cently, Zhang et al. [16] propose an eﬀective appearance

model based on structure sparse representation.

For generative tracking algorithm, a target candidate is

usually represented by a linear combination of templates

or atoms in a dictionary. The templates or a dictionary is

composed of the representative tracking results from previ-

ous frames across a video sequence. When the drastic ap-

pearance variations occur, this kind of representation is not

robust due to the inﬂuence of partial occlusion, illumina-

tion variation, etc. Inspired by the convex hull techniques

that is applied in face recognition [17], we propose a nov-

el visual tracking algorithm (referred to as CHT). A target

candidate is represented by a convex combination upon a

set of target templates in this work. The observation like-

lihood evaluation is an important issue. We evaluate the

likelihood of a target candidate based on Earth M over’s

Distance (EMD) [18] between a target candidate and the

target templates.

The remainder of this paper is organized as follows.

Section 2 present the proposed visual tracking algorithm.

978-1-5090-0654-0/16/$31.00

2016 IEEE 433 ICALIP 2016

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38678394

粉丝: 2
资源: 860

基于EMD的凸包模型在视觉追踪中的应用

计算三维convex hull凸体体积和面积的程序

convex hull

convexhull torch实现

Unity MeshCollider Convex Hull has more than 255 Polygons 怎么办

详细说明 arcgis convexhull

用python，写一个选择同一个簇的点云并且计算convex hull体积和xy投影面积的程序

AttributeError: 'ConvexHull' object has no attribute 'edges'

给出使用示例：cv2.convexHull()

arcgis convexhull和算法原理

咱们在python中安装ConvexHull库

hull = ConvexHull(point_cloud[:, :2]) 针对三维点云怎么修改 python

convexHull()

cv::convexHull(points, hull)

arcgis convexhull

hull = ConvexHull(proj_points) File "qhull.pyx", line 2431, in scipy.spatial.qhull.ConvexHull.__init__ File "qhull.pyx", line 279, in scipy.spatial.qhull._Qhull.__init__ ValueError: No points given

给出使用示例：cv2.convexHull()，完整可运行代码

opencv convexHull

c opencv凸包检测convex hull

cv2.convexHull

最新资源

hull = ConvexHull(proj_points) File "qhull.pyx", line 2431, in scipy.spatial.qhull.ConvexHull.init File "qhull.pyx", line 279, in scipy.spatial.qhull._Qhull.init ValueError: No points given