COOPERATIVE REINFORCEMENT LEARNING USING AN EXPERT-MEASURING WEIGHTED STRATEGY WITH WOLF

机译：使用狼的专家测量加权策略的合作加固学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Gradient descent learning algorithms have proven effective in solving mixed strategy games. The policy hill climbing (PHC) variants of WoLF (Win or Learn Fast) and PDWoLF (Policy Dynamics based WoLF) have both shown rapid convergence to equilibrium solutions by increasing the accuracy of their gradient parameters over standard Q-learning. Likewise, cooperative learning techniques using weighted strategy sharing (WSS) and expertness measurements improve agent performance when multiple agents are solving a common goal. By combining these cooperative techniques with fast gradient descent learning, an agent's performance converges to a solution at an even faster rate. This statement is verified in a stochastic grid world environment using a limited visibility hunter-prey model with random and intelligent prey. Among five different expertness measurements, cooperative learning using each PHC algorithm converges faster than independent learning when agents strictly learn from better performing agents.

机译：梯度下降学习算法已经证明有效解决混合策略游戏。狼（PHC）的狼（胜利或学习）和PDWOLF（基于政策动态的狼）的政策山攀爬（PHC）变体通过在标准Q-Learning上提高其梯度参数的准确性来表现出快速收敛到均衡解决方案。同样地，使用加权策略共享（WSS）和专业测量的协同学习技术改善了多个代理正在解决共同目标时的代理性能。通过将这些协作技术与快速梯度下降学习组合，代理的性能以偶然的速率收敛到解决方案。使用有关随机和智能猎物的有限的可见性猎人 - 猎物模型，在随机电网世界环境中验证了该声明。在五种不同的专业测量中，使用每个PHC算法的协作学习比独立学习收敛快，当代理人严格学习从更好的表演代理时。

著录项

来源
《IASTED International Conference on Artificial Intelligence and Soft Computing》|2005年||共6页
会议地点
作者
Kevin Cousin; Gilbert L. Peterson;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Multiagent reinforcement learning; Weighted strategy sharing;

机译：多元强化学习;加权战略分享;

相似文献

外文文献
中文文献
专利

1. Weighted cooperative reinforcement learning-based energy-efficient autonomous resource selection strategy for underlay D2D communication [J] . Sharma Sandeepika, Singh Brahmjit Communications, IET . 2019,第14期

机译：基于加权协作强化学习的底层D2D通信节能自主资源选择策略
2. A model-free distributed cooperative frequency control strategy for MT-HVDC systems using reinforcement learning method [J] . Zhong-Jie Hu, Zhi-Wei Liu, Chaojie Li, Journal of the Franklin Institute . 2021,第13期

机译：使用加固学习方法的MT-HVDC系统无模型分布式协作频率控制策略
3. Study and Application of Reinforcement Learning in Cooperative Strategy of the Robot Soccer Based on BDI Model [J] . Guo Qi, Wu Bo-ying International Journal of Advanced Robotic Systems . 2009,第2期

机译：基于BDI模型的强化学习在机器人足球合作策略中的研究与应用。
4. COOPERATIVE REINFORCEMENT LEARNING USING AN EXPERT-MEASURING WEIGHTED STRATEGY WITH WOLF [C] . Kevin Cousin, Gilbert L. Peterson IASTED International Conference on Artificial Intelligence and Soft Computing . 2005

机译：使用狼的专家测量加权策略的合作加固学习
5. Macro-Action-Based Multi-Agent Deep Reinforcement Learning in Cooperative Tasks [D] . Lu, Xingyu. 2021

机译：基于宏观动作的多智能经济型深度加强学习合作任务
6. Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules [O] . Eiji Uchibe 2018

机译：混合学习模式的合作和竞争性强化与模仿学习
7. Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments [O] . Yan Zheng, Zhaopeng Meng, Jianye Hao, 2018

机译：随机协同环境中加权双层多读强化学习

COOPERATIVE REINFORCEMENT LEARNING USING AN EXPERT-MEASURING WEIGHTED STRATEGY WITH WOLF

摘要

著录项

相似文献

相关主题

期刊订阅