【24h】

Building an Adaptive Learning Agent in V-world

机译:在V-world中构建自适应学习代理

获取原文
获取原文并翻译 | 示例

摘要

This paper describes Learner, an agent that learns from its experiences. Initially Learner knows neither the effects of its actions nor the properties of the objects in its world. Furthermore, Learner does not have an explicit representation of its environment. Instead it uses the world as its own model. The agent is given restricted abilities to perceive and to move, and the rules of death and survival. It explores its world, and through trial-and-error learns its properties. As a result Learner can survive indefinitely in low-risk worlds and lives long enough to be called intelligent in the highest-risk worlds. Learner is a reinforcement learning agent: it does not have a teacher; instead it employs a condition-action component, where the action evaluation is the reinforcement. This is an LPA Prolog implementation developed in V-world, an agent-testing computer environment featuring various simulated worlds.
机译:本文介绍了学习者,该学习者可以从其经验中学习。最初,学习者既不了解其动作的效果,也不了解其世界中对象的属性。此外,学习者没有其环境的明确表示。相反,它使用世界作为自己的模型。特工被赋予有限的感知和移动能力,以及死亡和生存规则。它探索世界,并通过反复试验来了解其特性。结果,学习者可以在低风险的世界中无限期生存,并且寿命足够长,可以在高风险的世界中被称为聪明人。学习者是强化学习的媒介:它没有老师。相反,它使用条件-动作组件,其中动作评估是增强。这是在V-world中开发的LPA Prolog实施,V-world是具有各种模拟世界的代理测试计算机环境。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号