首页> 外文会议>International Conference on Autonomous Agents and Multiagent Systems >Learning Plans by Acquiring Grounded Linguistic Meanings from Corrections
【24h】

Learning Plans by Acquiring Grounded Linguistic Meanings from Corrections

机译:通过获取从更正的基础语言含义来学习计划

获取原文

摘要

We motivate and describe a novel task which is modelled on interactions between apprentices and expert teachers. In the task the agent must learn to build towers which are constrained by rules. Whenever the agent performs an action which violates a rule the teacher provides verbal corrective feedback (e.g. "No, put red blocks on blue blocks") and answers the learner's clarification questions. The agent must learn to build rule compliant towers from these corrections and the context in which they were given. The agent starts out unaware of the constraints as well as the domain concepts in which the constraints are expressed. Therefore an agent that takes advantage of the linguistic evidence must learn the denotations of neologisms and adapt its conceptualisation of the planning domain to incorporate those denotations. We show that an agent which does utilise linguistic evidence outperforms a strong baseline which does not.
机译:我们激励并描述了一种关于学徒和专家教师之间的互动建模的新任务。 在任务中,代理人必须学会建立受规则约束的塔楼。 每当代理执行违反规则的动作时,教师提供口头纠正反馈(例如,“不,蓝块上的红色块”)并回答学习者的澄清问题。 代理人必须学会从这些更正和所提供的上下文中建立规则兼容塔。 该代理开始不知道约束以及表示约束的域概念。 因此,利用语言证据的代理必须了解新闻界的表示,并根据规划域的概念化来纳入那些的表情。 我们展示了一种利用语言证据的代理商优于一个强大的基线,这些基线没有。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号