首页> 外国专利> SYSTEM AND METHOD FOR UNCERTAINTY-BASED ADVICE FOR DEEP REINFORCEMENT LEARNING AGENTS

SYSTEM AND METHOD FOR UNCERTAINTY-BASED ADVICE FOR DEEP REINFORCEMENT LEARNING AGENTS

机译:基于不确定性的深度加强学习代理建议的系统和方法

摘要

Disclosed are systems, methods, and devices for training a learning agent. A learning agent that maintains a reinforcement learning neural network is instantiated. State data reflective of a state of an environment explored by the learning agent is received. An uncertainty metric calculated upon processing the state data, the uncertainty metric measuring epistemic uncertainty of the learning agent. Upon determining that the uncertainty metric exceeds a pre-defined threshold: a request signal requesting an action suggestion from a demonstrator is sent; a suggestion signal reflective of the action suggestion is received; and an action signal to implement the action suggestion is sent.
机译:公开了用于训练学习代理的系统,方法和设备。实例化了维护加强学习神经网络的学习代理。收到了学习代理探索的环境的状态数据。在处理状态数据时计算的不确定性度量,学习代理的不确定性度量测量认识性的不确定性度量不确定性。在确定不确定性度量超出预定义的阈值时:发送请求来自演示者的动作建议的请求信号;收到了反映了行动建议的建议信号;发送用于实现动作建议的动作信号。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号