An empirical study of potential-based reward shaping and advice in complex, multi-agent systems

Devlin S.; Kudenko D.; Grze M.

首页> 外文期刊>Advances in complex systems >An empirical study of potential-based reward shaping and advice in complex, multi-agent systems

【24h】

An empirical study of potential-based reward shaping and advice in complex, multi-agent systems

机译：基于复杂多主体系统中基于潜力的奖励塑造和建议的实证研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates the impact of reward shaping in multi-agent reinforcement learning as a way to incorporate domain knowledge about good strategies. In theory, potential-based reward shaping does not alter the Nash Equilibria of a stochastic game, only the exploration of the shaped agent. We demonstrate empirically the performance of reward shaping in two problem domains within the context of RoboCup KeepAway by designing three reward shaping schemes, encouraging specific behaviour such as keeping a minimum distance from other players on the same team and taking on specific roles. The results illustrate that reward shaping with multiple, simultaneous learning agents can reduce the time needed to learn a suitable policy and can alter the final group performance.

机译：本文研究了奖励塑造在多主体强化学习中的影响，这是一种结合有关良好策略的领域知识的方法。从理论上讲，基于势的奖励塑造不会改变随机博弈的纳什均衡，而只会改变对塑造者的探索。通过设计三种奖励塑造方案，鼓励特定行为，例如与同一团队中的其他玩家保持最小距离并扮演特定角色，我们通过RoboCup KeepAway的经验证明了在两个问题领域中奖励塑造的性能。结果表明，使用多个同时学习的学习者进行奖励塑造可以减少学习适当策略所需的时间，并且可以改变最终的团队绩效。

著录项

来源
《Advances in complex systems》 |2011年第2期|共28页
作者
Devlin S.; Kudenko D.; Grze M.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化系统理论;
关键词
multi-agent; Reinforcement learning; reward shaping;

机译：多主体强化学习奖励塑造;

相似文献

外文文献
中文文献
专利

1. An empirical study of potential-based reward shaping and advice in complex, multi-agent systems [J] . Devlin S., Kudenko D., Grze M. Advances in complex systems . 2011,第2期

机译：基于复杂多主体系统中基于潜力的奖励塑造和建议的实证研究
2. Context-sensitive reward shaping for sparse interaction multi-agent systems [J] . De Hauwere Yann-Michael, Devlin Sam, Kudenko Daniel, The Knowledge Engineering Review . 2016,第1期

机译：稀疏交互多主体系统的上下文相关奖励整形
3. Strategic planning, risk-taking and reward systems for managers in multi-divisional companies-an empirical study [J] . Rajeshwar Sirpal Marketing Intelligence & Planning . 1998,第4期

机译：跨部门公司经理的战略规划，风险承担和报酬系统-实证研究
4. Theoretical Considerations of Potential-Based Reward Shaping for Multi-Agent Systems [C] . Sam Devlin, Daniel Kudenko International conference on autonomous agents and multiagent systems;AAMAS 2011 . 2011

机译：多代理系统基于势的奖赏成形的理论考虑
5. Three empirical studies on the aggregate dynamics of humanly driven complex systems. [D] . Hidalgo, Cesar A. 2008

机译：关于人类驱动的复杂系统的总体动力学的三项实证研究。
6. Multi-agent systems in epidemiology: a first step for computational biology in the study of vector-borne disease transmission [O] . Benjamin Roche, Jean-François Guégan, François Bousquet 2008

机译：流行病学中的多主体系统：媒介生物学疾病传播研究中计算生物学的第一步
7. An Empirical Study of Potential-Based Reward Shaping and Advice in Complex, Multi-Agent Systems [O] . Sam Devlin, Daniel Kudenko, et al. 2011

机译：复杂多智能体系统中基于潜在回报的形成与建议的实证研究

An empirical study of potential-based reward shaping and advice in complex, multi-agent systems

摘要

著录项

相似文献

相关主题

期刊订阅