x
kþ1ðÞ
¼ cM
T
x
kðÞ
þ 1−cðÞv: (3)
To facilitate the following discussion, transform Equation 3 to
x
kþ1ðÞ
¼ Ax
kðÞ
þ f; (4)
where A = cM
T
and f =(1− c)v.
If we set x
(0)
= 0, the PageRank vector can be expressed as Neumann series:
x ¼ ∑
∞
i¼0
A
i
f; (5)
where A
1
¼ max
1≤ j≤n
∑
n
i¼1
a
ij
, A
i
¼ ∑
n
j¼1
a
ij
, A
∞
= max
1 ≤ i ≤ n
A
i
, f = max
1 ≤ i ≤ n
f
i
, and ρ(A) denote the spectral radius of matrix A. Because ρ(A)=c <1,
it is well‐known that Neumann series ∑
∞
i¼0
A
i
f is convergent.
27
If only the mth component of x in Equation 5 is considered, we have
x
m
¼ f
m
þ ∑
n
i
1
¼1
a
mi
1
f
i
1
þ ∑
n
i
1
;i
2
¼1
a
mi
1
a
i
1
i
2
f
i
2
þ … þ ∑
n
i
1
;i
2
; ::;i
k
¼1
a
mi
1
a
i
1
i
2
…a
i
k−1
i
k
f
i
k
þ …; (6)
where a
ij
∈ A and f
i
∈ f. Therefore, a selected vertex can independently compute the PageRank value by Equation 6, which is the mathematical form
of local PageRank problem.
2.2
|
Markov chain Monte Carlo method for local PageRank computations
In this subsection, we introduce how to compute the local PageRank problem by MCMC method, ie, obtain the component x
m
of Equation 6 by
random walks. We first assume that a
mj
= p
mj
b
mj
and p
mj
∈ P and b
mj
∈ B, where P is a Markov matrix and B is the adjoin matrix of P. In addition,
the entries of Markov matrix and adjoin matrix should satisfy Equation 7:
p
mj
>0 ; a
mj
≠0
p
mj
≥0 ; a
mj
¼ 0
8
<
:
and
b
mj
¼
a
mj
p
mj
; a
mj
≠0
b
mj
¼ 0 ; a
mj
¼ 0
8
<
:
; (7)
Therefore, Equation 6 can be represented as
x
m
¼ f
m
þ ∑
∞
k¼0
∑
n
i
1
¼1
∑
n
i
2
¼1
…∑
n
i
k
¼1
b
mi
1
b
i
1
i
2
…b
i
k−1
i
k
p
mi
1
p
i
1
i
2
…p
i
k−1
i
k
f
i
k
: (8)
Consider the problem of evaluating the inner product of a given vector g with the vector solution of Equation 5
g; xðÞ¼∑
n
α¼1
g
α
x
a
; (9)
where g ∈ R
n
. If we use MCMC method to calculate PageRank values, we should construct a random process way to compute a result as the desired
inner product. Assume a random walk S as follows
s
0
→s
1
→…→s
k
→…; (10)
where s
j
∈ {1, 2, … , n}, for j = 1,2,…. The transition probabilities of state P(s
0
= α)=p
α
and P(s
k
= β| s
k − 1
= α)=p
αβ
are defined as follows
p
αβ
≥0
∑
n
β¼1
p
αβ
¼ 1
(
and
p
αβ
≥0
∑
n
β¼1
p
αβ
¼ 1
(
(11)
Clearly, the entries p
αβ
of Equation 11 can satisfy Equation 7 as well.
We now define random variables W
k
and θ
*
as follows
W
k
¼ W
k−1
b
s
k−1
s
k
and θ
¼ ∑
∞
k
W
k
f
s
k
; where W
0
¼ 1: (12)
Therefore, we have the expectation of θ
*
,
Eθ
¼ f
s
0
þ ∑
∞
k¼0
∑
n
s
1
¼1
∑
n
s
2
¼1
…∑
n
s
k
¼1
b
s
0
s
1
b
s
1
s
2
…b
i
k−1
i
k
p
s
0
s
1
p
s
1
s
2
…p
s
k−1
s
k
f
s
k
: (13)
Thus, Eθ
*
= x
m
, when we set g as g
m
= 1 and g
i
=0, ∀ i ≠ m. Each calculation of expectation of θ
*
can be considered as a random walk.
Thus, local PageRank problem can be solved by MCMC method. The MCMC method simulates possible pathways for a single vertex as the
vertex is the termination of forward surfing. We can construct variant transition matrixes to approximate the PageRank scores, where the transition
matrices satisfy Equations 7 and 11. In this paper, we adopt the transition probability depending on the entries weighted
LAI ET AL. 3of15