马尔可夫决策过程在异构无线网络垂直切换中的应用

86 浏览量更新于2024-07-14 1 收藏 1.28MB PDF 举报

"异构无线网络中基于马尔可夫的垂直切换决策算法" 这篇研究论文探讨了在异构无线网络环境下如何优化移动终端（MT）的垂直切换策略。垂直切换是指在不同类型的无线网络之间（如4G、5G、Wi-Fi等）进行的切换，以维持或提高服务质量（QoS）。传统的切换决策通常基于单一属性，例如接收信号强度（RSS），而本文则提出了一个更为全面的方法。首先，作者提出了一种基于RSS的单属性切换决策算法。RSS是评估无线连接质量的关键指标，它反映了接收到的信号强度，直接影响着通信的稳定性和数据传输速率。这个算法旨在根据当前RSS值来决定是否进行网络切换。接着，论文深入研究了基于连接寿命的切换决策模型。连接寿命是衡量网络稳定性的一个重要因素，长时间的连接有助于提升用户体验。通过考虑连接寿命，MT能够在首选网络中保持更长时间，减少不必要的频繁切换，从而改善整体的网络性能。然而，不同的MT可能对QoS有不同的需求。因此，作者将垂直切换决策问题转化为一个马尔可夫决策过程（MDP）。MDP是一种处理随机决策问题的数学框架，适用于描述具有不确定性的动态系统。在这个过程中，目标是最大化预期的总回报（即服务质量）同时最小化平均切换次数，以平衡QoS和网络稳定性。为了实现这一目标，论文构建了一个奖励函数，它能够评估每个连接期间的QoS。然后，应用G1（一种MDP求解算法）和熵方法进行迭代计算，这有助于找出一个固定且确定的切换决策策略。这两种方法的结合使得在考虑多种因素的同时，能够动态适应MT的QoS需求变化。数值模拟结果显示，提出的方案相比于现有的切换决策算法具有显著优势，不仅提高了QoS，还减少了平均切换次数，表明了该方法在实际无线网络环境中的潜力和实用性。总结来说，这篇研究论文提出了一个基于马尔可夫模型的智能垂直切换决策算法，考虑了多属性和动态环境的影响，旨在优化异构无线网络中的服务质量和网络效率。这种方法对于未来无线网络的设计和管理具有重要的理论与实践意义。

(1) P

[t]: probability that the MT connects with the WLAN at time instant t.

(2) P

[t]: probability that the MT connects with the 3G network at time instant t.

(3) P

W|G

[t]: probability that the MT connects with the WLAN at time instant t given that it is associated with the 3G net-

work at time instant t-1.

(4) P

G|W

[t]: probability that the MT connects with the 3G network at time instant t given that it is associated with the

WLAN at time instant t-1.

We assume the MT is in the WLAN initially without loss of generality, therefore P

[0] = 1 and P

[0]=0.P

[t] and P

[t] can

be calculated by the following formulations,

½t þ 1¼P

WjG

½t þ 1P½tþð1  P

GjW

½t þ 1ÞP

½tð6Þ

½t þ 1¼P

GjW

½t þ 1P½tþð1  P

WjG

½t þ 1ÞP

½tð7Þ

Conditional probabilities P

W|G

[t + 1] and P

G|W

[t + 1] depend on the decision method. Similar with [22], we also deﬁne these

probabilities as:

GjW

½t þ 1¼PrðLT½t þ 1Þ < D

jW½t ; RSS½t þ 1 < T

Þð8Þ

where W[t] represents the situation that the MT connects with WLAN, and T

is a predeﬁned threshold of MO. The second

condition in Eq. (8) is important to the MT with low velocity. We can work out the transition probability by,

GjW

½t þ 1¼

1

Z½tþ1Z½t

ðZ

; Z

ÞdZ



½t



ð9Þ

where Z½ t¼RSS½tD

R½t, and

[t] and

[t] are the expectation and standard deviation of Z[t]. Q(x) is the complementary

error function, and P

W|C

[k + 1] can be worked out by,

WjG

½t þ 1¼

PrððRSS½t þ 1Þ > T

jRSS½t < T

Prð

RSS½t  < T

ð10Þ

where T

is a predeﬁned threshold of MI. The transition probabilities based calculation of the handoff probabilities are sim-

ilar to the methods adopted in [25].

The number of handoffs, presented by N

, has much impact on the signaling ﬂow, and it is the total number of MI and

MO. Therefore, N

is determined by the instantaneous probability of MI and MO, and can be worked out by:

½t þ 1¼P

GjW

½t þ 1P

½t ð11Þ

½t þ 1¼P

WjG

½t þ 1P

ð12Þ

The expectation of N

is,

EfN

g¼EfN

gþEfN

g¼

max

t¼1

ðP

½tþP

½tÞ ð13Þ

where t

max

is the time instant when the MT arrives at the edge of the WLAN, and it is determined by the MT’s velocity and the

coverage of the WLAN. N

and N

are the expected numbers of MOs and MIs, respectively. The ﬂow chart of MSA-VHO can

be seen in Fig. 2.

3. MDP-based multi-attribute handoff decision algorithm

Because the transition probability between different system states has no relationship with the past state, and the deci-

sion is dependent on the combination of multiple parameters, our considered network model matches well with the Markov

process. Fig. 3 illustrates the network system where more than one WLAN network exists.

The formulated Markov process based vertical handoff decision model includes ﬁve elements: decision epoch, state, ac-

tion, transition probability and reward. During each decision epoch, the MT decides whether to remain in the current net-

work or switch to other networks.

3.1. Markov based handoff decision model

We consider the MT chooses an action a based on its current state information. The state space is denoted by s. For each

state s

S, the state information includes the network identiﬁcation number indicating which network the MT currently con-

nects with, the available bandwidth and the average delay of each candidate network. X

denotes the state at decision epoch

Z. Ning et al. / Computers and Electrical Engineering 40 (2014) 456–472

459

剩余16页未读，继续阅读

weixin_38706045

粉丝: 4
资源: 950

马尔可夫决策过程在异构无线网络垂直切换中的应用

联合无线资源管理技术的改进型垂直切换算法

基于移动预测的垂直切换算法

车辆异构网络中基于马尔可夫过程的垂直切换优化算法。

超密集异构无线网络中基于位置预测的高效切换算法研究

无线传输中基于马尔可夫决策的高能效策略

马尔可夫决策法优化异构无线网络垂直切换策略

马尔可夫过程优化的车辆异构网络垂直切换算法提升QoS

论文研究-基于最优功率控制和马尔可夫链优化的异构无线网络.pdf

基于DA优化模糊神经网络的异构无线网络接入选择算法.docx

大规模马尔可夫决策过程的算法

最新资源