首页> 外国专利> - METHOD AND APPARATUS FOR ADAPTIVE MULTI-BATCH EXPERIENCE REPLAY FOR CONTINUOUS ACTION CONTROL

- METHOD AND APPARATUS FOR ADAPTIVE MULTI-BATCH EXPERIENCE REPLAY FOR CONTINUOUS ACTION CONTROL

机译:-用于连续动作控制的自适应多批次体验重放的方法和装置

摘要

An adaptive multi-batch experience replay technique for continuous action space control. In the adaptive multi-batch experience replay (AMBER) method, storing information tuples of samples generated based on the updated policy in a replay memory in multiple batches, random mini-batch Adjusting the size of) to reduce the average importance sampling specific gravity, calculating the average importance sampling specific gravity of each sample batch in the replay memory, for the replay memory, the calculated Dropping a batch having an average importance sampling specific gravity greater than a predetermined batch drop coefficient, and updating parameters by performing random mini-batch sampling based on the batch excluded from the drop, targeting the replay memory. You can.
机译:自适应多批次体验重播技术,用于连续动作空间控制。在自适应多批次体验重播(AMBER)方法中,将基于更新策略生成的样本的信息元组存储在多个批次的重播存储器中,随机进行小批量调整,以减小平均重要性抽样比重,计算重放存储器中每个样本批次的平均重要性抽样比重,对于重放存储器,计算出的平均重要性抽样比重大于预定批次丢弃系数的丢弃批次,并通过执行随机小批量更新参数基于从删除中排除的批次进行采样,以重播内存为目标。您可以。

著录项

  • 公开/公告号KR102103644B1

    专利类型

  • 公开/公告日2020-04-23

    原文格式PDF

  • 申请/专利权人 한국과학기술원;

    申请/专利号KR20180102008

  • 发明设计人 성영철;한승열;

    申请日2018-08-29

  • 分类号G06N20;G06F7/02;G06N3/08;

  • 国家 KR

  • 入库时间 2022-08-21 11:04:51

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号