无线传感器网络强化学习增强路由研究

doi:10.3969/j.issn.0255-8297.2024.01.007

应用科学学报 ›› 2024, Vol. 42 ›› Issue (1): 83-93.doi: 10.3969/j.issn.0255-8297.2024.01.007

无线传感器网络强化学习增强路由研究

张华南¹, 李石君², 金红³

1. 广东培正学院数据科学与计算机学院, 广东广州 510830;
2. 武汉大学计算机学院, 湖北武汉 430072;
3. 湖北大学计算机与信息工程学院, 湖北武汉 430062

收稿日期:2023-06-30 出版日期:2024-01-30 发布日期:2024-02-02
通信作者: 张华南,教授,研究方向为传感器与人工智能。E-mail:2602502@peizheng.edu.cn E-mail:2602502@peizheng.edu.cn

Research on Enhanced Routing for Reinforcement Learning in Wireless Sensor Networks

ZHANG Huanan¹, LI Shijun², JIN Hong³

1. School of Data Science and Computer, Guangdong Peizheng College, Guangzhou 510830, Guangdong, China;
2. School of Computer, Wuhan University, Wuhan 430072, Hubei, China;
3. School of Computer Science and Information Engineering, Hubei University, Wuhan 430062, Hubei, China

Received:2023-06-30 Online:2024-01-30 Published:2024-02-02

摘要/Abstract

摘要： 探讨了在无线网络树型路由中寻找最优父节点的经典问题，分析了影响树型路由决策规则的多个指标，如接收信号强度的加权平均值、缓冲区占用率和功耗比。提出了一种基于强化学习增强树路由协议和强化学习算法在无线传感器网络中应用的系统模型，并详细说明了所提出的基于树的路由协议的基本操作，为循环检测父节点更新了算法；为了在复杂的场景中做出自适应决策，定义了一个状态空间、动作集和激励函数。通过试错找到激励最高的最佳父节点；并通过模拟比较研究，验证了父节点选择方案在性能指标（即端到端延迟、可靠性和能量消耗）之间进行合理权衡。

关键词: 无线传感器网络, 树型路由, 强化学习, 多个目标

Abstract: The classical problem of finding the optimal parent node in wireless network tree routing is discussed in this study. Various indexes affecting the decision rules of tree routing are analyzed, such as weighted average received signal strength, buffer occupation rate and power consumption ratio. A system model of enhanced tree routing protocol and reinforcement learning algorithm based on reinforcement learning is proposed in wireless sensor networks. The basic operation of the proposed tree-based routing protocol is described in detail, and the algorithm is updated for cyclic detection of parent node. In order to make adaptive decisions in complex scenarios, a state space, an action set and an excitation function are defined. The optimal parent node with the highest excitation is identified through trial and error. Through simulation and comparative study, it is verified that the parent node selection scheme achieves reasonable tradeoff among the performance indicators such as end-to-end delay, reliability and energy consumption. Through simulation and comparative analysis, the efficacy of the parent node selection scheme is validated, demonstrating a judicious tradeoff among performance indicators such as end-to-end delay, reliability, and energy consumption.

Key words: wireless sensor network, tree-based routing, reinforcement learning, multiple targets

中图分类号:

TP393

张华南, 李石君, 金红. 无线传感器网络强化学习增强路由研究[J]. 应用科学学报, 2024, 42(1): 83-93.

ZHANG Huanan, LI Shijun, JIN Hong. Research on Enhanced Routing for Reinforcement Learning in Wireless Sensor Networks[J]. Journal of Applied Sciences, 2024, 42(1): 83-93.

参考文献

[1] Liu X X. Atypical hierarchical routing protocols for wireless sensor networks: a review [J]. IEEE Sensors Journal, 2015, 15(10): 5372-5383.
[2] Delaney D T, Higga R, O'hare G M P. A stable routing framework for tree-based routing structures in WSNs [J]. IEEE Sensor, 2014, 14(10): 3533-3547.
[3] Baccour N, Koubaa A, Youssef H, et al. Reliable link quality estimation in low-power wireless networks and its impact on tree-routing [J]. Ad Hoc Networks, 2015, 27: 1-25.
[4] Bashir N, Boudjit S, Zeadally S. A closed-loop control architecture of UAV and WSN for traffic surveillance on highways [J]. Computer Communications. 2022, 190: 78-86.
[5] Avokh A, Mirjalily G. Load-balanced multicast tree routing in multi channel multi radio wireless mesh networks using a new cost function [J]. Wireless Personal Communications, 2013, 69(1): 75-106.
[6] Liu Y, Qian K Y. A novel tree-based routing protocol in ZigBee wireless networks [C]//IEEE International Conference on Communication Software and Networks (ICCSN), 2016: 469-473.
[7] Sinfh M, Sethi M, Lai N, Poonia S. A tree based routing protocol for mobile sensor networks [J]. International Journal of Computational Science and Engineering, 2010, 2, 55-60.
[8] Han Z, Wu J, Zhang J, et al. A general self-organized tree-based energy-balance routing protocol for wireless sensor network [J]. IEEE Transactions on Nuclear Science, 2014, 61(2): 732-740.
[9] Mittal N, Singh U, Salgotra R. Tree-based threshold-sensitive energy-efficient routing approach for wireless sensor networks [J]. Wireless Personal Communications, 2019, 108(1): 473-492.
[10] Lu J Y, Hu K F, Yang X C, et al. A cluster-tree-based energy-efficient routing protocol for wireless sensor networks with a mobile sink [J]. The Journal of Supercomputing, 2021, 77(6): 6078-6104.
[11] Mazinani S M, Naderi A, Jalali M. A tree-based reliable routing protocol in wireless sensor networks [C]//International Symposium on Computer, Consumer and Control, 2012, 7: 491-494.
[12] Gnana P O S, Varalakshmi P. Decision tree based routing protocol (DTRP) for reliable path in MANET [J]. Wireless Personal Communications, 2019, 109(1): 257-270.
[13] Hasheminejad E, Barati H. A reliable tree-based data aggregation method in wireless sensor networks [J]. Peer-to-Peer Networking and Applications, 2021, 14(2): 873-887.
[14] Narayan V, Daniel A K, Chaturvedi P. E-FEERP: enhanced fuzzy based energy efficient routing protocol for wireless sensor network [J]. Wireless Personal Communications, 2023, 131(1): 371-398.
[15] Al-Kiyumi R M, Foh C H, Vural S, et al. Fuzzy logic-based routing algorithm for lifetime enhancement in heterogeneous wireless sensor networks [J]. IEEE Transactions on Green Communications and Networking, 2018, 2(2): 517-532.
[16] Bagci H, Yazici A. An energy aware fuzzy approach to unequal clustering in wireless sensor networks [J]. Applied Soft Computing, 2013, 13(4): 1741-1749.
[17] Jiang Y, Li X Y, Qin C, et al. Improved particle swarm optimization based selective harmonic elimination and neutral point balance control for three-level inverter in low-voltage ride-through operation [J]. IEEE Transactions on Industrial Informatics, 2022, 18(1): 642-652.
[18] Yue C, Qin Z R, Lang Y P, et al. Determination of thin metal film's thickness and optical constants based on SPR phase detection by simulated annealing particle swarm optimization [J]. Optics Communications, 2019, 430: 238-245.

无线传感器网络强化学习增强路由研究

Research on Enhanced Routing for Reinforcement Learning in Wireless Sensor Networks

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	葛星, 秦丽, 沙瀛. 基于投影奖励机制的多机器人协同编队与避障[J]. 应用科学学报, 2024, 42(1): 174-188.
[2]	彭凯, 刘培琛, 许小龙, 周星宇. 面向智慧城市的多依赖任务计算迁移研究[J]. 应用科学学报, 2023, 41(3): 391-404.
[3]	张晓龙, 王庆伟, 李尚滨. 基于强化学习的多模态场景人体危险行为识别方法[J]. 应用科学学报, 2021, 39(4): 605-614.
[4]	刘乐钰, 汪祖民, 郑祖朋, 秦静, 季长清. 基于RF的无线传感器网络的MAC协议设计[J]. 应用科学学报, 2021, 39(4): 672-684.
[5]	王嫣, 常晓鹏, 张建平. 基于EKF算法的UWB和ZigBee测量技术的混合运动目标定位[J]. 应用科学学报, 2019, 37(6): 815-824.
[6]	张雪凡, 刘源, 李洪. 一种无路由器的两层低功耗无线传感器网络[J]. 应用科学学报, 2019, 37(2): 271-281.
[7]	李国瑞, 田丽, 崔浩, 陈浩波. 一种基于自编码器的无线传感网数据收集方案[J]. 应用科学学报, 2018, 36(3): 411-419.
[8]	徐昶, 王聪, 刘灵雅, 李宁. 基于强化学习的M2M网络自适应媒体接入控制协议[J]. 应用科学学报, 2017, 35(3): 317-325.
[9]	焉晓贞，谢红，王桐. 无线传感器网络的不确定传感数据预测[J]. 应用科学学报, 2012, 30(6): 566-572.
[10]	张金艺1;3，段苏阳1，吴玉见1，王春华1，丁梦玲2. 无线传感器网络中的协作波纹定位[J]. 应用科学学报, 2012, 30(2): 120-127.
[11]	杨俊刚1，史浩山1，段爱媛2，张龙妹1. 基于流量预报的无线传感器网络自适应拥塞控制路由协议[J]. 应用科学学报, 2011, 29(2): 124-128.
[12]	田炜杨震. 基于传输半径倍数的无线传感器网络交替路由[J]. 应用科学学报, 2010, 28(4): 342-346.
[13]	陈昊，姚国良，刘昊. 隐终端下S-MAC协议性能分析[J]. 应用科学学报, 2009, 27(6): 563-568.
[14]	邓曙光,沈连丰,朱晓荣,杨冰. 大规模无线传感器网络中近似静态分簇的高效概率覆盖协议[J]. 应用科学学报, 2009, 27(5): 446-452.
[15]	郭立新1，赵雷平2. 无线传感器网络中的多种追逃策略[J]. 应用科学学报, 2009, 27(3): 231-237.