强化学习驱动的无线传感器网络智能路由算法

64 浏览量更新于2024-08-27 收藏 284KB PDF 举报

本文主要探讨了"无线传感器网络中的基于强化学习的智能路由算法"（An Intelligent Routing Algorithm in Wireless Sensor Networks based on Reinforcement Learning）。随着无线传感器网络（Wireless Sensor Networks, WSNs）在各种应用领域的广泛应用，如环境监测、军事侦察等，网络寿命的延长一直是关注的焦点。为了优化WSNs的性能并提高其工作时间，研究者们提出了一种创新的算法，即RLLO（Reinforcement Learning-based Lifetime Optimization）。 RLLO算法的核心思想是利用强化学习（Reinforcement Learning, RL）的特性，通过智能地选择数据包传输路径来平衡节点间的能量消耗和通信效率。它考虑了节点剩余能量和跳数两个关键因素，将它们纳入奖励函数的设计中。这种策略旨在实现能量的均匀分布，避免过早耗尽某些节点的能源，从而显著延长网络的整体生命周期。与传统的能源感知路由（Energy-Aware Routing, EAR）算法以及其改进版本（Improved Energy-Aware Routing, I-EAR）进行了比较。结果显示，RLLO在提升网络寿命和数据包交付成功率方面表现出了显著的优势。算法通过动态调整路由策略，减少了不必要的通信负担，并有效地降低了节点的能耗。值得注意的是，RLLO算法的优势在于其无需额外的成本投入，而是通过学习和优化网络行为来达到节能的效果。它能够在复杂多变的环境中自我调整，随着网络运行的不断迭代，算法的性能会逐步提高。这种自适应性和优化能力对于延长WSNs的生存周期至关重要。这篇研究论文为无线传感器网络的路由策略设计提供了一个新颖且实用的解决方案，展示了强化学习在解决此类问题时的潜力，对于未来WSNs的能源管理和网络优化具有重要的理论和实际意义。

An Intelligent Routing Algorithm in Wireless Sensor Networks based on

Reinforcement Learning

Wenjing Guo

1,a *

, Cairong Yan

2,b

, Yanglan Gan

3,c

and Ting Lu

4,d

1,2,3,4

School of Computer Science and Technology, Donghua University, No.2999, North Renmin

Road, Songjiang District, Shanghai, China

wjguo@dhu.edu.cn,

cryan@dhu.edu.cn,

ylgan@dhu.edu.cn,

luting@dhu.edu.cn

Keywords: Wireless sensor networks (WSNs); Network lifetime; Intelligent routing; Reinforcement

learning (RL); Packet delivery.

Abstract. Lifetime enhancement has been a hot issue in Wireless Sensor Networks (WSNs). To

prolong the network lifetime of WSNs, this paper proposes an intelligent routing algorithm named

RLLO. RLLO makes uses of the superiority of reinforcement learning (RL) and considers residual

energy and hop count to define the reward function. It is to uniformly distribute the energy

consumption and improve the packet delivery without additional cost. This proposed algorithm has

been compared with Energy Aware Routing (EAR) and improved EAR (I-EAR). Simulation results

show that RLLO gains a significant improvement in terms of network lifetime and packet delivery

over these two algorithms.

Introduction

Due to the specific characteristics of Wireless Sensor Networks (WSNs) in which sensor nodes have

limited energy supply, constrained computation and communication ability [1,2], network lifetime

becomes the mayor concern in WSNs. Therefore, the goal of routing algorithm in WSNs is to prolong

the network lifetime as far as possible.

To achieve such a goal of enhancing network lifetime, many routing algorithms have been

specially designed for WSNs. Among all of the proposed algorithms, Energy Aware Routing (EAR)

[3] is the most typical data-centric routing algorithms. It is based on such an idea that always using the

minimum energy path is not advisable since such a way will deplete the energy of nodes on that path

and the network will get partitioned. It maintains multiple paths between source node and destination

node. Then, sub-optimal paths are occasionally chosen to balance the energy consumption among the

whole network. EAR has its inherent advantage of trying to balance energy consumption between

nodes to postpone the time when the first sensor node dies, and it has been proved in [3] to provide an

overall improvement of 21.5% energy saving and a 44% increase in network lifetime over Directed

Diffusion. Moreover, this algorithm has been compared in the survey [4] to have much stronger

energy efficiency among all the data-centric routing algorithms in WSNs. Thus, we compare our

proposed algorithm with this algorithm.

However, there are some problems in EAR. First, in the determination of routing path, only energy

consumption and remaining energy are considered, but other metrics such as delay and packet

delivery are not taken into account. Second, not just in the setup phase, in the route maintenance

phase, flooding also occurs. Such flooding brings about much more additional overhead. Third, for

the data communication phase, it completely depends on the routing table. This routing table has been

established in advance. It can not absolutely reflect the current condition of the network. Authors in [5]

propose a new algorithm to improve the performance of EAR. In this algorithm I-EAR, sensor nodes

choose one route among several routes with a probability. The probability is determined by the

residual energy of nodes, the energy consumption of the communication and the number of paths

including the forwarding node. This algorithm has been simulated and compared with EAR. Results

show that it outperforms EAR by prolonging the time of first-node-death. However, it has the same

problems as EAR does.

Applied Mechanics and Materials Vol. 678 (2014) pp 487-493 Submitted: 26.08.2014

Online available since 2014/Oct/08 at www.scientific.net Accepted: 26.08.2014

doi:10.4028/www.scientific.net/AMM.678.487

www.ttp.net. (ID: 114.93.68.99-14/10/14,13:58:47)

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38516706

粉丝: 9
资源: 888

强化学习驱动的无线传感器网络智能路由算法

A Distributed Geo-Routing Algorithm for Wireless Sensor Networks.pdf

Maximizing the Network Lifetime by Using PACO Routing Algorithm in Wireless Sensor Networks

A Flow-Partitioned Unequal Clustering Routing Algorithm for Wireless Sensor Networks

Routing Techniques in Wireless Sensor Networks

Optimal power control based opportunistic routing in linear wireless sensor networks

Optimizing Opportunistic Routing in Asynchronous Wireless Sensor Networks

An Energy-Efficient Routing Algorithm for Underwater Wireless Sensor Networks Inspired by Ultrasonic Frogs

An Efficient Bypassing Void Routing Algorithm for Wireless Sensor Network

Global and Local Reliability-Based Routing Protocol for Wireless Sensor Networks

An Energy-aware Unequal Clustering Routing Protocol for Wireless Sensor Networks

最新资源