Ants and Reinforcement Learning: A Case Study in Routing in Dynamic Networks

机译：蚂蚁与强化学习：动态网络路由中的案例研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate two new distributed routing algorithms for data networks based on simple biological "ants" that explore the network and rapidly learn good routes, using a novel variation of reinforcement learning. Tehse two algorithms are fully adaptive to topology changes and changes in link costs in the network, and have space and computational overheads that are competitive with traditional packet routing algorithms: although they can generate more routing traffic when the rate of failures in a network is low, they perform much better under higher fialrue rates. Both algorithms are more resilient than traditional algorithms, in the sense that random corruption of routing state has limited impact on the computation of paths. We present convergence theorems for both of our algorithms drawing on the theory of non-stationary and stationary discrete-time Markov chains over the reals. We present an extensive empirical evalaution of our algorithms on a simulator that is widely used in the computer networks community for validating and testing protocols. We present comparative results on data delivery performance, aggregate routing traffic (algorithm overhead), as well as the degree of resilience for our new algorithms and two traditional routing algorithms in current use. We also show that the performance our algorithms scale well with increase in network size using a realistic topology.

机译：我们研究了基于简单生物“蚂蚁”的两种新的数据网络分布式路由算法，这些算法探索网络并使用强化学习的新变化快速学习良好的路由。这两种算法完全适应拓扑变化和网络中链路成本的变化，并且具有空间和计算开销，与传统的分组路由算法相比具有竞争优势：尽管当网络的故障率较低时，它们可以产生更多的路由流量，在较高的fialrue率下它们的性能要好得多。从路由状态的随机破坏对路径计算的影响有限的意义上来说，这两种算法都比传统算法更具弹性。我们基于实数上的非平稳和固定离散时间马尔可夫链的理论，给出了两种算法的收敛定理。我们在一个广泛用于计算机网络社区中用于验证和测试协议的模拟器上，对算法进行了广泛的经验验证。我们提供了关于数据传递性能，聚合路由流量（算法开销）以及我们的新算法和当前使用的两种传统路由算法的弹性程度的比较结果。我们还表明，使用实际拓扑，随着网络规模的增加，我们算法的性能可以很好地扩展。

著录项

来源
《International joint conference on artificial intelligence;IJCAI-97》|1997年|p.832-838|共7页
会议地点
作者
Devika Subramanian; Peter Druschel; Johnny Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Hybrid Privacy Preserving Algorithm based on Ants and Reinforcement Learning for Distributed and Adaptive routing in TOR Networks [J] . S. Shakila, Gopinath Ganapathy Australian Journal of Basic and Applied Sciences . 2015,第2015期

机译：TOR网络中基于蚂蚁和强化学习的分布式自适应路由混合隐私保护算法。
2. A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks [J] . Cong P., Zhang Y., Wang W., Computer networks . 2021,第Juna19期

机译：基于深度加强基于学习的动态IOT网络的多功能路由方案
3. RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic loT networks [J] . Sharma Deepak Kumar, Rodrigues Joel J. P. C., Vashishth Vidushi, Wireless Networks . 2020,第6期

机译：Rlproproph：一种基于动态规划的机会批次网络最优路由的加固学习方法
4. Ants and Reinforcement Learning: A Case Study in Routing in Dynamic Networks [C] . Devika Subramanian, Peter Druschel, Johnny Chen International Joint Conference on Artificial Intelligence . 2007

机译：蚂蚁和强化学习：动态网络路由路由的案例研究
5. Multipath routing algorithms for communication networks: Ant routing and optimization based approaches. [D] . Purkayastha, Punyaslok. 2009

机译：通信网络的多路径路由算法：基于蚂蚁路由和优化的方法。
6. A Reinforcement Learning Routing Protocol for UAV Aided Public Safety Networks [O] . Hassan Ishtiaq Minhas, Rizwan Ahmad, Waqas Ahmed, 2021

机译：用于UAV辅助公共安全网络的加强学习路由协议
7. Route Selection for Multi-Hop Cognitive Radio Networks Using Reinforcement Learning: An Experimental Study [O] . Aqeel Raza Syed, Kok-Lim Alvin Yau, Junaid Qadir, 2016

机译：使用强化学习的多跳认知无线电网络的路线选择：实验研究

Ants and Reinforcement Learning: A Case Study in Routing in Dynamic Networks

摘要

著录项

相似文献

相关主题

期刊订阅