基于图模型随机游走的视觉跟踪方法

86 浏览量更新于2024-08-26 收藏 768KB PDF 举报

"本文提出了一种利用图模型上的随机游走进行视觉跟踪的新方法，将超级像素节点和它们之间的关系表示为边，并结合马尔可夫随机游走理论，构建了两个创新的图模型，一个用于全局搜索特征相似的候选节点，另一个用于模拟连续帧间的时空连贯性。最后通过结构模型综合了随机游走得到的外观相似度和目标不同部分显示的内部空间布局，以生成最终的置信地图。" 在"通过图模型上的RandomWalks进行视觉跟踪"这篇研究论文中，作者Xiaoli Li, Zhifeng Han, Lijun Wang和Huchuan Lu提出了一种基于图理论的视觉跟踪新框架。他们将视觉跟踪问题转化为在图模型上进行随机游走的问题，其中图的节点代表图像中的超级像素，而边则表示这些超级像素之间的相互关系。首先，他们建立了一个遍历性的马尔可夫链（Ergodic Markov Chain），这个链的目标是在全局范围内寻找与模板节点（即目标对象的初始状态）特征相似的节点。遍历性确保了游走能够在图中所有节点间均匀分布，从而能有效地搜索到与目标特征匹配的节点。其次，为了捕捉时间序列中的目标连续性，作者引入了吸收马尔可夫链（Absorbing Markov Chain）。这种链能够模拟从一帧到下一帧的目标运动，通过将前一帧的目标状态作为吸收态，来保持跟踪过程的稳定性，增强目标在时间上的连贯性。接下来，他们提出了一种结构模型，将上述两个马尔可夫链的结果相结合。该结构模型不仅考虑了由随机游走计算出的外观相似度，还考虑了目标不同部分的空间布局信息。这样，通过综合分析，可以生成一张表示每个像素属于目标概率的置信地图，从而更准确地进行目标定位和跟踪。论文中，作者对提出的马尔可夫链方法和结构模型进行了定性和定量的评估，证明了这种方法的有效性。通过这种方法，即使在复杂的场景变化和遮挡情况下，也能实现稳定且准确的视觉跟踪。这篇文章提出了一种新颖的视觉跟踪策略，它利用图模型和随机游走理论，结合了全局搜索和时空连贯性，提高了跟踪算法的性能，对于视觉跟踪领域的研究具有重要的理论价值和实践意义。

This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.

LI et al.: VISUAL TRACKING VIA RANDOM WALKS ON GRAPH MODEL 3

Fig. 2. Pipeline of our method. (a) Input frame at time t. (b) Search region after oversegmentation. (c) Proposed random walks on two graph models.

(d) Proposed structural model. (e) Final conﬁdence map.

states and absorbing states. Given an absorbing chain with

absorbing states and M

transient states, we renumber the

states so that the transient states come ﬁrst. Then the transition

matrix P has the following canonical form:

P =





(5)

where Q ∈ [0, 1]

×M

contains the transition probability

between any pair of the transient states; R ∈ [0, 1]

×M

con-

tains the probability of reaching any absorbing state from any

transient state; 0 is the M

× M

zero matrix; and I is the

×M

identity matrix. Starting at any transient state, the ran-

dom walk process on an absorbing Markov chain will always

be absorbed. The fundamental matrix of an absorbing Markov

chain is deﬁned as

N =

(

I − Q

)

−1

(6)

where N = [N

]

×M

; N

denotes the expected number of

times that the process stays in transient state S

after starting

from transient state S

; and



is the absorbing time of

node i, i.e., the expected number of times before the chain is

absorbed (into any absorbing state), given that the chain starts

in transient state S

. The absorbing time for every transient

state can then be computed by

τ = N1 (7)

where τ = [τ

,τ

,...,τ

]



; τ

is the absorbing time for

transient state S

; 1 is a column vector whose entries are

all ones.

III. T

RACKING VIA RANDOM WALKS ON GRAPH MODELS

The pipeline of our method is illustrated in Fig. 2. When

a new frame arrives, we ﬁrst segment the search region

into n superpixels [see Fig. 2(b)]. Then, we construct two

graph models and perform Markov random walks on them

to compute a conﬁdence map which measures the probabil-

ity of each superpixel belonging to the target area. In order

The search region is a square area centered at X

, the target location of

the last frame. The side length of the search region is set to λ

·(S(X

))

(1/2)

where S(X

) represents the area of target area X

. The parameter λ

controls

the size of this surrounding region and is set to two in experiments.

Fig. 3. Graph model G

, E

) based on ergodic Markov chain. Each

rectangle represents a node. The green candidate nodes and white candidate

nodes denote target nodes and background nodes, respectively. The template

nodes are superpixels in the target region of the ﬁrst frame.

to make the conﬁdence map more robust, a structural model

is constructed to exploit structural information between object

parts. Finally, the target is located by maximum a posteriori

estimate.

A. Graph Construction

Two kinds of directed graphs, G

, E

) and G

, E

are constructed with superpixels as nodes and the relation-

ships between connected nodes as edges. The weight of an

edge indicates how closely related the corresponding two con-

nected nodes are, that is, the random walker will transfer from

node i to j with a high probability if the weight w

of the

directed edge e

is large.

1) Graph Model Based on Ergodic Markov Chain: As

shown in Fig. 3, the nodes of G

, E

) are composed of m

template nodes denoted by T = [T

,...,T

] and n candidate

nodes denoted by C = [C

,...,C

], where T

and C

are the

mean CIE Lab color values of the ith template node and the

jth candidate node, respectively.

The template nodes represent

superpixels in the target area of the ﬁrst frame, while candidate

nodes represent superpixels in the search region of the current

frame. Each template node is connected to the rest nodes in

the graph, including other template nodes and all the candidate

nodes. Each candidate node is connected to its neighboring

candidate nodes and all the template nodes. The weight of the

We also use T

and C

to denote the ith template node and the jth candidate

node in this paper.

剩余11页未读，继续阅读

weixin_38729685

粉丝: 4
资源: 927

基于图模型随机游走的视觉跟踪方法

目标跟踪的视觉注意计算模型.pdf

DSST算法详解：多尺度视觉跟踪的精准尺度估计

NBA篮球图像数据集：计算机视觉中的目标检测与跟踪

智源研究院发布通用视觉分割模型SegGPT

【实战演练】基于Singer模型的Matlab机动目标跟踪算法分析

YOLOv5无人机视觉检测模型训练及应用案例分析

光伏系统最大功率点跟踪仿真模型与应用

机器人运动学参数标定：D-H模型与激光跟踪仪应用

Python pgmpy库：构建与推理概率图模型详解

ORB-SLAM3：视觉、视觉惯性和多图SLAM的开源库

最新资源