首页> 外文会议>International joint conference on artificial intelligence;IJCAI-97 >Ants and Reinforcement Learning: A Case Study in Routing in Dynamic Networks
【24h】

Ants and Reinforcement Learning: A Case Study in Routing in Dynamic Networks

机译:蚂蚁与强化学习:动态网络路由中的案例研究

获取原文

摘要

We investigate two new distributed routing algorithms for data networks based on simple biological "ants" that explore the network and rapidly learn good routes, using a novel variation of reinforcement learning. Tehse two algorithms are fully adaptive to topology changes and changes in link costs in the network, and have space and computational overheads that are competitive with traditional packet routing algorithms: although they can generate more routing traffic when the rate of failures in a network is low, they perform much better under higher fialrue rates. Both algorithms are more resilient than traditional algorithms, in the sense that random corruption of routing state has limited impact on the computation of paths. We present convergence theorems for both of our algorithms drawing on the theory of non-stationary and stationary discrete-time Markov chains over the reals. We present an extensive empirical evalaution of our algorithms on a simulator that is widely used in the computer networks community for validating and testing protocols. We present comparative results on data delivery performance, aggregate routing traffic (algorithm overhead), as well as the degree of resilience for our new algorithms and two traditional routing algorithms in current use. We also show that the performance our algorithms scale well with increase in network size using a realistic topology.
机译:我们研究了基于简单生物“蚂蚁”的两种新的数据网络分布式路由算法,这些算法探索网络并使用强化学习的新变化快速学习良好的路由。这两种算法完全适应拓扑变化和网络中链路成本的变化,并且具有空间和计算开销,与传统的分组路由算法相比具有竞争优势:尽管当网络的故障率较低时,它们可以产生更多的路由流量,在较高的fialrue率下它们的性能要好得多。从路由状态的随机破坏对路径计算的影响有限的意义上来说,这两种算法都比传统算法更具弹性。我们基于实数上的非平稳和固定离散时间马尔可夫链的理论,给出了两种算法的收敛定理。我们在一个广泛用于计算机网络社区中用于验证和测试协议的模拟器上,对算法进行了广泛的经验验证。我们提供了关于数据传递性能,聚合路由流量(算法开销)以及我们的新算法和当前使用的两种传统路由算法的弹性程度的比较结果。我们还表明,使用实际拓扑,随着网络规模的增加,我们算法的性能可以很好地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号