Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

Shayegan Omidshafiei; Jason Pazis; Christopher Amato; Jonathan P. How; John Vian

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

【24h】

Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

机译：部分可观察性下的深度分散多任务多智能体强化学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many real-world tasks involve multiple agents with partial observability and limited communication. Learning is challenging in these settings due to local viewpoints of agents, which perceive the world as non-stationary due to concurrently-exploring teammates. Approaches that learn specialized policies for individual tasks face problems when applied to the real world: not only do agents have to learn and store distinct policies for each task, but in practice identities of tasks are often non-observable, making these approaches inapplicable. This paper formalizes and addresses the problem of multi-task multi-agent reinforcement learning under partial observability. We introduce a decentralized single-task learning approach that is robust to concurrent interactions of teammates, and present an approach for distilling single-task policies into a unified policy that performs well across multiple related tasks, without explicit provision of task identity.

机译：许多实际任务涉及多个具有部分可观察性且通信受限的代理。在这些环境中，由于特工的局部观点，在这些环境中学习颇具挑战性，由于同时探索队友，他们认为世界是不稳定的。学习针对单个任务的专用策略的方法在应用于现实世界时会遇到问题：不仅代理必须为每个任务学习和存储不同的策略，而且在实践中，任务的身份通常不可观察，从而使这些方法不适用。本文对部分可观察性下的多任务多智能体强化学习问题进行了形式化处理。我们介绍了一种分散的单任务学习方法，该方法对队友的并发交互具有鲁棒性，并提出了一种方法，用于将单任务策略提炼为可在多个相关任务之间良好执行的统一策略，而无需明确提供任务身份。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第4期|共10页
作者
Shayegan Omidshafiei; Jason Pazis; Christopher Amato; Jonathan P. How; John Vian;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability [J] . Shayegan Omidshafiei, Jason Pazis, Christopher Amato, JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：部分可观察性下的深度分散多任务多智能体强化学习
2. Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning [J] . de Souza Cristino Jr., Newbury Rhys, Cosgun Akansel, IEEE Robotics and Automation Letters . 2021,第3期

机译：使用深度加强学习分散多智能经纪追求
3. Multi-task Reinforcement Learning in Partially Observable Stochastic Environments [J] . Li Hui, Liao Xuejun, Carin Lawrence Journal of machine learning research . 2009,第May期

机译：部分可观察的随机环境中的多任务强化学习
4. Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability [C] . Shayegan Omidshafiei, Jason Pazis, Christopher Amato, International Conference on Machine Learning . 2018

机译：局部可观察性下深入分散的多任务多功能辅助加固学习
5. Multi-Task Generalization Using Practice for Distributed Deep Reinforcement Learning [D] . Pattnaik, Upasana. 2021

机译：多任务泛化使用分布式深度加强学习的实践
6. On-Demand Channel Bonding in Heterogeneous WLANs: A Multi-Agent Deep Reinforcement Learning Approach [O] . Hang Qi, Hao Huang, Zhiqun Hu, 2020

机译：异构WLAN中的按需信道绑定：多代理深度强化学习方法
7. Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning [O] . Cristino de Souza, Rhys Newbury, Akansel Cosgun, 2021

机译：使用深度加强学习分散多智能经纪追求

Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

摘要

著录项

相似文献

相关主题

期刊订阅