Delay-Optimal Traffic Engineering through Multi-agent Reinforcement Learning

机译：通过多主体强化学习进行时延最优交通工程

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traffic engineering is one of the most important methods of optimizing network performance by designing optimal forwarding and routing rules to meet the quality of service (QoS) requirements for a large volume of traffic flows. End-to-end (E2E) delay is one of the key TE metrics. Optimizing E2E delay, however, is very challenging in large-scale multihop networks due to the profound network uncertainties and dynamics. This paper proposes a model-free TE framework that adopts multi-agent reinforcement learning for distributed control to minimize the E2E delay. In particular, distributed TE is formulated as a multi-agent extension of Markov decision process (MA-MDP). To solve this problem, a modular and composable learning framework is proposed, which consists of three interleaving modules including policy evaluation, policy improvement, and policy execution. Each of component can be implemented using different algorithms along with their extensions. Simulation results show that the combination of several extensions, such as double learning, expected policy evaluation, and on-policy learning, can provide superior E2E delay performance under high traffic load cases.

机译：通过设计最佳的转发和路由规则来满足大量流量的服务质量（QoS）要求，流量工程是优化网络性能的最重要方法之一。端到端（E2E）延迟是关键的TE指标之一。然而，由于巨大的网络不确定性和动态性，在大型多跳网络中优化E2E延迟非常具有挑战性。本文提出了一种无模型的TE框架，该框架采用多智能体强化学习进行分布式控制，以最大程度地降低E2E延迟。特别地，将分布式TE公式化为Markov决策过程（MA-MDP）的多主体扩展。为了解决这个问题，提出了一种模块化且可组合的学习框架，该框架由三个交错模块组成，包括策略评估，策略改进和策略执行。每个组件都可以使用不同的算法及其扩展来实现。仿真结果表明，双重学习，预期策略评估和策略学习等多种扩展的组合可以在高流量负载情况下提供出色的端到端延迟性能。

著录项

来源
《IEEE Conference on Computer Communications Workshops》|2019年|435-442|共8页
会议地点
作者
Pinyarash Pinyoanuntapong; Minwoo Lee; Pu Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Delays; Reinforcement learning; Quality of service; Markov processes; Spread spectrum communication;

机译：延迟;强化学习;服务质量;马尔可夫过程;扩频通信;

相似文献

外文文献
中文文献
专利

1. Joint Traffic Control and Multi-Channel Reassignment for Core Backbone Network in SDN-IoT: A Multi-Agent Deep Reinforcement Learning Approach [J] . Wu Tong, Zhou Pan, Wang Binghui, Network Science and Engineering, IEEE Transactions on . 2021,第1期

机译：SDN-IOT中的核心骨干网络联合交通管制和多通道重新分配：多智能经纪深度加强学习方法
2. Adaptive Traffic Signal Control for large-scale scenario with Cooperative Group-based Multi-agent reinforcement learning [J] . Wang Tong, Cao Jiahua, Hussain Azhar Transportation research . 2021,第Apra期

机译：基于合作组的多功能协同钢筋学习的大规模场景的自适应交通信号控制
3. Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning [J] . Li Zhenning, Yu Hao, Zhang Guohui, Transportation research . 2021,第Apra期

机译：网络范围的交通信号控制优化使用多功能深度增强学习
4. Delay-Optimal Traffic Engineering through Multi-agent Reinforcement Learning [C] . Pinyarash Pinyoanuntapong, Minwoo Lee, Pu Wang IEEE Conference on Computer Communications Workshops . 2019

机译：延迟最优交通工程通过多功能钢筋学习
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning [O] . Zhenning Li, Hao Yu, Guohui Zhang, 2021

机译：网络范围的交通信号控制优化使用多功能深度增强学习

Delay-Optimal Traffic Engineering through Multi-agent Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅