首页> 外文会议>International conference on artificial general intelligence >Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture
【24h】

Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture

机译:Sigma认知体系中适应性心理理论的强化学习

获取原文

摘要

One of the most common applications of human intelligence is social interaction, where people must make effective decisions despite uncertainty about the potential behavior of others around them. Reinforcement learning (RL) provides one method for agents to acquire knowledge about such interactions. We investigate different methods of multiagent reinforcement learning within the Sigma cognitive architecture. We leverage Sigma's architectural mechanism for gradient descent to realize four different approaches to multiagent learning: (1) with no explicit model of the other agent, (2) with a model of the other agent as following an unknown stationary policy, (3) with prior knowledge of the other agent's possible reward functions, and (4) through inverse reinforcement learning (IRL) of the other agent's reward function. While the first three variations re-create existing approaches from the literature, the fourth represents a novel combination of RL and IRL for social decision-making. We show how all four styles of adaptive Theory of Mind are realized through Sigma's same gradient descent algorithm, and we illustrate their behavior within an abstract negotiation task.
机译:人工智能是最常见的应用之一,它是社交互动,尽管人们不确定周围其他人的潜在行为,但他们仍必须做出有效的决策。强化学习(RL)为代理提供了一种获取有关此类交互的知识的方法。我们研究了在Sigma认知架构内进行多主体强化学习的不同方法。我们利用Sigma的梯度下降机制来实现四种不同的多主体学习方法:(1)没有其他主体的显式模型;(2)遵循未知平稳策略的其他主体模型;(3)另一个代理人可能的奖励功能的先验知识,以及(4)通过反向增强学习(IRL)来了解另一个代理人的奖励功能。前三种变体从文献中重建了现有方法,而第四种则代表了RL和IRL的新颖组合,可用于社会决策。我们展示了如何通过Sigma的相同梯度下降算法来实现自适应思维理论的所有四种样式,并说明了它们在抽象协商任务中的行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号