首页>
外国专利>
REINFORCEMENT LEARNING EXPLORATION BY EXPLOITING PAST EXPERIENCES FOR CRITICAL EVENTS
REINFORCEMENT LEARNING EXPLORATION BY EXPLOITING PAST EXPERIENCES FOR CRITICAL EVENTS
展开▼
机译:通过探索关键事件的过去经验来进行强化学习探索
展开▼
页面导航
摘要
著录项
相似文献
摘要
A computer-implemented method is provided for reinforcement learning performed by a processor. The method includes obtaining, from an environment, a given experience that includes an action, a state and a reward. The method further includes storing the given experience in an experience buffer responsive to a value of the reward included in the given experience exceeding a first threshold. The method also includes responsive to obtaining another experience having another reward that less than or equal to the first threshold, searching the experience buffer for a candidate experience with a similar state to the other experience and copying the candidate experience into an event buffer. The method additionally includes during exploration, selecting an action to be taken to the environment from the event buffer with a predetermined probability.
展开▼