首页>
外国专利>
SYSTEM AND METHOD FOR UNCERTAINTY-BASED ADVICE FOR DEEP REINFORCEMENT LEARNING AGENTS
SYSTEM AND METHOD FOR UNCERTAINTY-BASED ADVICE FOR DEEP REINFORCEMENT LEARNING AGENTS
展开▼
机译:基于不确定性的深度加强学习代理建议的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Disclosed are systems, methods, and devices for training a learning agent. A learning agent that maintains a reinforcement learning neural network is instantiated. State data reflective of a state of an environment explored by the learning agent is received. An uncertainty metric calculated upon processing the state data, the uncertainty metric measuring epistemic uncertainty of the learning agent. Upon determining that the uncertainty metric exceeds a pre-defined threshold: a request signal requesting an action suggestion from a demonstrator is sent; a suggestion signal reflective of the action suggestion is received; and an action signal to implement the action suggestion is sent.
展开▼