Open Theoretical QUestions in Reinforcement Learning

机译：强化学习中的开放理论问题

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning concerns the problem of a learning agent interacting with its environmnet ot achieve a goal. istead of being given exmples of desired behavior, the learning agent must discover by trial and error how to behave in order to get the most rewared. The environment is a Markov decision process with state set, delta, and action set, A. The agent and the environmen interact in a sequence of discrete steps, t=0,1,2,---

机译：强化学习涉及学习代理与其环境交互以实现目标的问题。学习代理商必须通过反复试验发现行为方式，以便获得最大的收益，而不是获得期望的行为示例。环境是一个状态集为增量，动作集为A的马尔可夫决策过程。主体和环境以一系列离散的步骤进行交互，t = 0,1,2，-

著录项

来源
《Computational learning theory》|1999年|p.11-17|共7页
会议地点 Nordkirchen(DE);Nordkirchen(DE)
作者
Richard S.Sutton;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning Toolbox: Reinforcement Learning for Optimal Control Tasks Institute for Theoretical Computer Science TU-GRAZ [J] . Gerhard Neumann OGAI Journal . 2007,第3期

机译：强化学习工具箱：针对最优控制任务的强化学习理论计算机科学研究院TU-GRAZ
2. Learning to Ask Medical Questions using Reinforcement Learning [J] . Uri Shaham, Tom Zahavy, Cesar Caraballo, JMLR: Workshop and Conference Proceedings . 2020,第2010期

机译：学习使用强化学习提出医疗问题
3. Specialization in Hierarchical Learning Systems: A Unified Information-theoretic Approach for Supervised, Unsupervised and Reinforcement Learning [J] . Heinke Hihn, Daniel A. Braun Neural processing letters . 2020,第3期

机译：分层学习系统的专业化：统一的信息 - 监督，无监督和强化学习的理论方法
4. Open Theoretical QUestions in Reinforcement Learning [C] . Richard S.Sutton European conference on computational learning theory . 1999

机译：在加固学习中开放理论问题
5. Stronger bidding strategies through empirical game-theoretic analysis and reinforcement learning. [D] . Schvartzman, Leonardo Julian. 2009

机译：通过经验博弈论分析和强化学习，可以制定更强的出价策略。
6. How much of reinforcement learning is working memory not reinforcement learning? A behavioral computational and neurogenetic analysis [O] . Anne G. E. Collins, Michael J. Frank -1

机译：钢筋学习多少是工作记忆而不是加强学习？行为计算和神经肝分析
7. Learning When Not to Answer: a Ternary Reward Structure for Reinforcement Learning Based Question Answering [O] . Fréderic Godin, Anjishnu Kumar, Arpit Mittal 2019

机译：没有回答的时候学习：基于强化学习的问题的三元奖励结构

Open Theoretical QUestions in Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅