Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture

机译：Sigma认知体系中适应性心理理论的强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the most common applications of human intelligence is social interaction, where people must make effective decisions despite uncertainty about the potential behavior of others around them. Reinforcement learning (RL) provides one method for agents to acquire knowledge about such interactions. We investigate different methods of multiagent reinforcement learning within the Sigma cognitive architecture. We leverage Sigma's architectural mechanism for gradient descent to realize four different approaches to multiagent learning: (1) with no explicit model of the other agent, (2) with a model of the other agent as following an unknown stationary policy, (3) with prior knowledge of the other agent's possible reward functions, and (4) through inverse reinforcement learning (IRL) of the other agent's reward function. While the first three variations re-create existing approaches from the literature, the fourth represents a novel combination of RL and IRL for social decision-making. We show how all four styles of adaptive Theory of Mind are realized through Sigma's same gradient descent algorithm, and we illustrate their behavior within an abstract negotiation task.

机译：人工智能是最常见的应用之一，它是社交互动，尽管人们不确定周围其他人的潜在行为，但他们仍必须做出有效的决策。强化学习（RL）为代理提供了一种获取有关此类交互的知识的方法。我们研究了在Sigma认知架构内进行多主体强化学习的不同方法。我们利用Sigma的梯度下降机制来实现四种不同的多主体学习方法：（1）没有其他主体的显式模型;（2）遵循未知平稳策略的其他主体模型;（3）另一个代理人可能的奖励功能的先验知识，以及（4）通过反向增强学习（IRL）来了解另一个代理人的奖励功能。前三种变体从文献中重建了现有方法，而第四种则代表了RL和IRL的新颖组合，可用于社会决策。我们展示了如何通过Sigma的相同梯度下降算法来实现自适应思维理论的所有四种样式，并说明了它们在抽象协商任务中的行为。

著录项

来源
《International conference on artificial general intelligence》|2014年|143-154|共12页
会议地点
作者
David V. Pynadath; Paul S. Rosenbloom; Stacy C. Marsella;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Analysis and solution of a predator-protector-prey multi-robot system by a high-level reinforcement learning architecture and the adaptive systems theory [J] . Jose Antonio Martin H., Javier de Lope, Dario Maravall Robotics and Autonomous Systems . 2010,第12期

机译：基于高级强化学习架构和自适应系统理论的捕食者—保护者—猎物多机器人系统分析与解决方案
2. A learning architecture based on reinforcement learning for adaptive control of the walking machine LAURON [J] . Winfried Ilg, Karsten Berns Robotics and Autonomous Systems . 1995,第4期

机译：基于强化学习的步行机LAURON自适应控制的学习架构
3. Design and Development of Hybrid Architecture Model Named Enhanced Mind Cognitive Architecture of pupils for Implementing the Learning Concepts in Society of Agents [J] . D. Ganesha, Vijayakumar Maragal Venkatamuni Indian Journal of Science and Technology . 2017,第1期

机译：设计和开发名为“增强智能认知架构的学生”的混合体系结构模型，以实现智能体社会中的学习概念
4. "Re:ROS": Prototyping of Reinforcement Learning Environment for Asynchronous Cognitive Architecture [C] . Sei Ueno, Masahiko Osawa, Michita Imai, International Early Research Career Enhancement School on Biologically Inspired Cognitive Architectures and Cybersecurity . 2018

机译：“RE：ROS”：异步认知架构的加固学习环境的原型设计
5. Hybrid learning approach based on adaptive resonance theory and reinforcement learning for computer generated agents. [D] . Ninomiya, Susumu. 2002

机译：基于自适应共振理论和针对计算机生成的主体的强化学习的混合学习方法。
6. The Relationship Between Different Aspects of Theory of Mind and Symptom Clusters in Psychotic Disorders: Deconstructing Theory of Mind Into Cognitive Affective and Hyper Theory of Mind [O] . Laura M.-L. Dorn, Nele Struck, Florian Bitsch, 2021

机译：精神病障碍理论与症状簇的不同方面的关系：对认知情感和超思想的解构思想理论
7. Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture [O] . David V. Pynadath, Paul S. Rosenbloom, Stacy C. Marsella 2015

机译：西格玛认知结构中自适应心理理论的强化学习
8. Adaptive Mesh Refinement for Efficient Exploration of Cognitive Architectures and Cognitive Models [R] . Best, B. J., Furjanic, C., Gerhart, N., 2009

机译：自适应网格细化有效探索认知结构和认知模型

Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture

摘要

著录项

相似文献

相关主题

期刊订阅