Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning

机译：在强化学习中结合基于开发和基于探索的方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Watkins' Q-learning is the most popular and an effective model-free method. However, comparing model-based approach, Q-learning with various exploration strategies require a large number of trial-and-error interactions for finding an optimal policy. To overcome this drawback, we propose a new model-based learning method extending Q-learning. This method has separated EI and ER functions for learning exploitation-based and exploration-based model, respectively. EI function based on statistics indicates the best action. The another ER function based on the information of exploration leads the learner to well-unknown region in the global state space by backing up in each step. Then, we introduce a new criterion as the information of exploration. Using combined these function, we can effectively proceed exploitation and exploration strategies and can select an action which considers each strategy simultaneously.

机译：沃特金斯（Watkins）的Q学习是最流行且最有效的无模型方法。但是，与基于模型的方法相比，具有各种探索策略的Q学习需要大量的反复试验才能找到最佳策略。为了克服这个缺点，我们提出了一种新的基于模型的扩展Q学习的学习方法。该方法将EI和ER功能分开，分别用于学习基于开发的模型和基于探索的模型。基于统计的EI功能指示最佳操作。基于探索信息的另一个ER功能通过在每个步骤中进行备份，将学习者引导到全局状态空间中的未知区域。然后，我们引入了一个新的准则作为探索的信息。结合使用这些功能，我们可以有效地进行开发和勘探策略，并可以选择同时考虑每种策略的动作。

著录项

来源
《Second International Conference on Intelligent Data Engineering and Automated Learning - IDEAL 2000: Data Mining, Financial Engineering, and Intelligent Agents , Dec 13-15, 2000, Hong kong, China》|2000年|p.326-331|共6页
会议地点 Hong Kong(CN);Hong Kong(CN)
作者
Kazunori Iwata; Nobuhiro Ito; Koichiro Yamauchi; Naohiro Ishii;
展开▼
作者单位

Dept. of Intelligence and Computer Science, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Combined heat and power system intelligent economic dispatch: A deep reinforcement learning approach [J] . Zhou Suyang, Hu Zijian, Gu Wei, International journal of electrical power and energy systems . 2020,第Sepa期

机译：综合发电系统智能经济派遣：深度加固学习方法
2. Accuracy-Based Learning Classifier Systems for Multistep Reinforcement Learning: A Fuzzy Logic Approach to Handling Continuous Inputs and Learning Continuous Actions [J] . Gang Chen, Colin I. J. Douch, Mengjie Zhang IEEE transactions on evolutionary computation . 2016,第6期

机译：基于精度的多步强化学习分类器系统：处理连续输入和学习连续动作的模糊逻辑方法
3. Reinforcement Learning Approach for Adaptive E-learning Systems using Learning Styles [J] . Balasubramanian Velusamy, S. Margret Anouneia, George Abraham Information Technology Journal . 2013,第12期

机译：使用学习方式的自适应电子学习系统的强化学习方法
4. Combining exploitation-based and exploration-based approach in reinforcement learning [C] . Kazunori Iwata, Nobuhiro Ito, Koichiro Yamauchi, Intelligent Data Engineering and Automated Learning . 2000

机译：基于剥削和基于探索的加强学习方法
5. Coping with the curse of dimensionality by combining linear programming and reinforcement learning. [D] . Burton, Scott H. 2010

机译：通过将线性规划和强化学习相结合，应对维度的诅咒。
6. Correction: Linking Individual Learning Styles to Approach-Avoidance Motivational Traits and Computational Aspects of Reinforcement Learning [O] . Kristoffer Carl Aberg, Kimberly C. Doell, Sophie Schwartz -1

机译：纠正：将个人学习风格与避免方法的动机特征和强化学习的计算方面联系起来
7. Combined heat and power system intelligent economic dispatch: A deep reinforcement learning approach [O] . Suyang Zhou, Zijian Hu, Wei Gu, 2020

机译：综合发电系统智能经济派遣：深度加固学习方法

Combining Exploitation-Based and Exploration-Based Approach in Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅