首页> 外文学位 >Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning.
【24h】

Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning.

机译:使用强化模糊学习在追逃性差分游戏中学习。

获取原文
获取原文并翻译 | 示例

摘要

In this thesis, Q-learning fuzzy inference system is applied to pursuit-evasion differential games. The suggested technique allows both the evader and the pursuer to learn their optimal strategies simultaneously. Reinforcement learning is used to autonomously tune the input parameters and the fuzzy rules of a fuzzy controller for both the evader and the pursuer. We focus more on the behaviours and the strategies of the trained evader. The evader is trained to find its optimal strategy from the received rewards during the game. The homicidal chauffeur game and the game of two cars are used as examples of the method. The simulation results of the suggested technique demonstrate that the trained evader is able to learn its optimal strategies. Furthermore, the learning speed is investigated when using Q-learning with eligibility traces in pursuit-evasion differential games.
机译:本文将Q学习模糊推理系统应用于逃避型差分博弈中。所建议的技术可以使逃避者和追踪者同时学习其最佳策略。强化学习用于自动调整逃避者和追随者的输入参数和模糊控制器的模糊规则。我们更多地关注受过训练的逃避者的行为和策略。逃避者经过训练,可以在游戏过程中从收到的奖励中找到最佳策略。该方法以杀人司机游戏和两辆汽车游戏为例。所建议技术的仿真结果表明,训练有素的逃避者能够学习其最佳策略。此外,在追逃式差分游戏中使用具有资格踪迹的Q学习时,研究了学习速度。

著录项

  • 作者

    Al Faiya, Badr.;

  • 作者单位

    Carleton University (Canada).;

  • 授予单位 Carleton University (Canada).;
  • 学科 Engineering Robotics.;Engineering System Science.
  • 学位 M.A.Sc.
  • 年度 2012
  • 页码 87 p.
  • 总页数 87
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号