首页> 外文会议>American Control Conference >Toward Resilient Multi-Agent Actor-Critic Algorithms for Distributed Reinforcement Learning

【24h】

Toward Resilient Multi-Agent Actor-Critic Algorithms for Distributed Reinforcement Learning

机译：对分布式强化学习的弹性多功能演员 - 评论家批评算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper considers a distributed reinforcement learning problem in the presence of Byzantine agents. The system consists of a central coordinating authority called "master agent" and multiple computational entities called "worker agents". The master agent is assumed to be reliable, while, a small fraction of the workers can be Byzantine (malicious) adversaries. The workers are interested in cooperatively maximize a convex combination of the honest (non-malicious) worker agents' long-term returns through communication between the master agent and worker agents. A distributed actor-critic algorithm is studied which makes use of entry-wise trimmed mean. The algorithm's communication-efficiency is improved by allowing the worker agents to send only a scalar-valued variable to the master agent, instead of the entire parameter vector, at each iteration. The improved algorithm involves computing a trimmed mean over only the received scalar-valued variable. It is shown that both algorithms converge almost surely.

机译：本文认为在拜占庭试剂存在下分布式加固学习问题。该系统由一个名为“Master Agent”和称为“Worker代理”的多个计算实体的中央协调权限组成。假设硕士代理是可靠的，而少数工人可以是拜占庭（恶意）对手。工作人员对诚信（非恶意）工人的长期回报的长期回报的凸起组合有兴趣，通过硕士代理商和工人代理商之间的沟通。研究了分布式演员 - 批评算法，它利用进入明智的修剪平均值。通过允许工人代理仅向主代理发送标量词，而不是整个参数向量，可以在每个迭代中将算法的通信效率提高。改进的算法涉及仅计算所接收的标量值变量的修剪均值。结果表明，两种算法几乎肯定会聚。

著录项

来源
《American Control Conference》|2019年|p3393-4082|共6页
会议地点
作者
Yixuan Lin; Shripad Gade; Romeil Sandhu; Ji Liu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动控制理论;
关键词

相似文献

外文文献
中文文献
专利

1. A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning [J] . Wesley Suttle, Zhuoran Yang, Kaiqing Zhang, IFAC PapersOnLine . 2020,第2期

机译：用于分布式强化学习的多功能脱机演员 - 批评算法
2. Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method [J] . Paulo C. Heredia, Shaoshuai Mou IFAC PapersOnLine . 2019,第20期

机译：演员 - 批评方法分布式多功能加固学习
3. Resilient adaptive optimal control of distributed multi-agent systems using reinforcement learning [J] . Rohollah Moghadam, Hamidreza Modares Control Theory & Applications, IET . 2018,第16期

机译：基于强化学习的分布式多主体系统弹性自适应最优控制
4. Toward Resilient Multi-Agent Actor-Critic Algorithms for Distributed Reinforcement Learning [C] . Yixuan Lin, Shripad Gade, Romeil Sandhu, Annual American Control Conference . 2020

机译：面向分布式强化学习的弹性多主体Actor-Critic算法
5. A Bounded Actor-Critic Algorithm for Reinforcement Learning [D] . Lawhead, Ryan Jacob. 2017

机译：一种有限于钢筋学习的批评算法
6. Believer-Skeptic Meets Actor-Critic: Rethinking the Role of Basal Ganglia Pathways during Decision-Making and Reinforcement Learning [O] . Kyle Dunovan, Timothy Verstynen 2016

机译：怀疑论者遇到演员批评者：重新思考基础神经节通路在决策和强化学习中的作用
7. A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning [O] . Wesley Suttle, Zhuoran Yang, Kaiqing Zhang, 2020

机译：用于分布式强化学习的多功能脱机演员 - 批评算法

Toward Resilient Multi-Agent Actor-Critic Algorithms for Distributed Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅