Learning to Explore via Meta-Policy Gradient

Tianbing Xu; Qiang Liu; Liang Zhao; Jian Peng

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Learning to Explore via Meta-Policy Gradient

【24h】

Learning to Explore via Meta-Policy Gradient

机译：通过元策略梯度学习探索

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of off-policy learning, including deep Q-learning and deep deterministic policy gradient (DDPG), critically depends on the choice of the exploration policy. Existing exploration methods are mostly based on adding noise to the on-going actor policy and can only explore local regions close to what the actor policy dictates. In this work, we develop a simple meta-policy gradient algorithm that allows us to adaptively learn the exploration policy in DDPG. Our algorithm allows us to train flexible exploration behaviors that are independent of the actor policy, yielding a global exploration that significantly speeds up the learning process. With an extensive study, we show that our method significantly improves the sample-efficiency of DDPG on a variety of reinforcement learning continuous control tasks.

机译：非政策学习的性能（包括深度Q学习和深度确定性策略梯度（DDPG））在很大程度上取决于探索策略的选择。现有的探索方法主要是基于对正在进行的参与者策略增加干扰，并且只能探索与参与者策略所指示的内容接近的局部区域。在这项工作中，我们开发了一种简单的元策略梯度算法，使我们能够自适应地学习DDPG中的勘探策略。我们的算法使我们能够训练独立于参与者策略的灵活探索行为，从而产生可显着加快学习过程的全局探索。通过广泛的研究，我们证明了我们的方法可以显着提高DDPG在各种强化学习连续控制任务中的采样效率。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2009期|共10页
作者
Tianbing Xu; Qiang Liu; Liang Zhao; Jian Peng;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Model-Based Reinforcement Learning via Meta-Policy Optimization [J] . Ignasi Clavera, Jonas Rothfuss, John Schulman, JMLR: Workshop and Conference Proceedings . 2018,第2010期

机译：通过元政策优化基于模型的强化学习
2. Exploring the effects of web-mediated activity-based learning and meaningful learning on improving students' learning effects, learning engagement, and academic motivation [J] . Tsai Meng-Chuan, Shen Pei-Di, Chen Wen-Yu, Universal access in the information society . 2020,第4期

机译：探索Web介导的基于活动的学习和有意义学习的影响提高学生学习效果，学习参与和学术动机
3. Exam Success at Undergraduate and Graduate-Entry Medical Schools: Is Learning Style or Learning Approach More Important? A Critical Review Exploring Links Between Academic Success, Learning Styles, and Learning Approaches Among School-Leaver Entry ("Traditional") and Graduate-Entry ("Nontraditional") Medical Students [J] . Feeley Anne-Marie, Biggerstaff Deborah L. Teaching and learning in medicine . 2015,第3期

机译：本科和研究生入学医学院的考试成功：学习方式或学习方法更重要吗？批判性评论，探讨在校生入学（“传统”）和研究生入学（“非传统”）医学生的学业成就，学习方式和学习方法之间的联系
4. Learning Oscillator-Based Gait Controller for String-Form Soft Robots Using Parameter-Exploring Policy Gradients [C] . Matthew Ishige, Takuya Umedachil, Tadahrio Taniguchi, IEEE/RSJ International Conference on Intelligent Robots and Systems . 2018

机译：使用参数探索策略梯度学习用于弦形软机器人的基于振荡器的步态控制器
5. Learning Transferable Meta-Policies for Hierarchical Task Decomposition and Planning Composition [D] . Djurdjevic, Predrag. 2019

机译：学习分层任务分解和规划构成的可转让元政策
6. Active explorers show low learning performance in a social insect [O] . Eve Udino, Margot Perez, Claudio Carere, 2017

机译：积极的探索者在社交昆虫中表现出较低的学习表现
7. Home > All Content > Vol 50, No 1 (2019) Exploring Research–Policy Partnerships in International Development Cover Page Edited by: James Georgalakis and Pauline Rose June 2019 Volume 50 Issue 1 This issue aims to identify how partnerships focused on the production of policy-engaged research seek to achieve societal impact and explores the challenges in these processes. The collaborations analysed span academia, civil society and government, from the grassroots to the national and global levels. By locating these examples within the broader debates on interactions between researchers and research users designed to strengthen evidence informed decision making, this publication offers concepts and practices to inform those funding, designing and undertaking development research. The featured case studies are explored through the perspectives of both researchers and their partners in civil society and policy. They are predominantly taken from a diverse portfolio of research projects funded through the UK’s Economic and Social Research Council (ESRC) and the Department for International Development (DFID) Strategic Partnership. A collaboration with the Impact Initiative, this IDS Bulletin is essential reading for all those in research organisations, development agencies and donors committed to the better use of evidence and learning for development. Exploring Research–Policy Partnerships in International Development James Georgalakis, Pauline Rose DOI: 10.19088/1968-2019.100 ABSTRACT FULL ISSUE PDF Foreword Diana Dalton DOI: 10.19088/1968-2019.102 ABSTRACT PDFONLINE ARTICLE Introduction: Identifying the Qualities of Research–Policy Partnerships in International Development – A New Analytical Framework James Georgalakis, Pauline Rose DOI: 10.19088/1968-2019.103 ABSTRACT PDFONLINE ARTICLE Rethinking Research Impact through Principles for Fair and Equitable Partnerships Kate Newman, Sowmyaa Bharadwaj, Jude Fransman DOI: 10.19088/1968-2019.104 ABSTRACT PDFONLINE ARTICLE Pathways to Impact: Insights from Research Partnerships in Uganda and India Rachel Hinton, Rona Bronwin, Laura Savage DOI: 10.19088/1968-2019.105 ABSTRACT PDFONLINE ARTICLE Exploring Partnerships between Academia and Disabled Persons’ Organisations: Lessons Learned from Collaborative Research in Africa Maria Kett, Mark T. Carew, John-Bosco Asiimwe, Richard Bwalya, Anderson Gitonga, Boakai A. Nyehn, Joyce Olenja, Leslie Swartz, Nora Groce DOI: 10.19088/1968-2019.106 ABSTRACT PDFONLINE ARTICLE Layered and Linking Research Partnerships: Learning from YOUR World Research in Ethiopia and Nepal Vicky Johnson, Anannia Admassu, Andrew Church, Jill Healey, Sujeeta Mathema DOI: 10.19088/1968-2019.107 ABSTRACT PDFONLINE ARTICLE Fundamental Challenges in Academic–Government Partnership in Conflict Research in the Pastoral Lowlands of Ethiopia Mercy Fekadu Mulugeta, Fana Gebresenbet, Yonas Tariku, Ekal Nettir DOI: 10.19088/1968-2019.108 ABSTRACT PDFONLINE ARTICLE Regional Research–Policy Partnerships for Health Equity and Inclusive Development: Reflections on Opportunities and Challenges from a Southern African Perspective [O] . Nicola Yeates, Themba Moeti, Mubita Luwabelwa 2019

机译：主页>所有内容> Vol 50，第1号（2019年）探索国际发展顾客的研究 - 政策伙伴关系：詹姆斯格拉巴拉斯和波雷琳上升2019年6月50卷第1卷这个问题旨在确定合作伙伴关系如何关注政策的生产。 - 繁殖的研究寻求实现社会影响并探索这些过程中的挑战。合作分析了跨度学术界，民间社会和政府，从基层到国家和全球层面。通过在研究人员和研究用户之间的相互作用的更广泛辩论中找到这些示例，旨在加强证据明智的决策，本出版物提供了通知这些资金，设计和开发研究的概念和实践。通过研究人员及其合作伙伴在民间社会和政策的角度来看，探讨了特色案例研究。它们主要从英国经济和社会研究委员会（ESRC）和国际发展部（DFID）战略伙伴部资助的各种研究项目组合中获取。与影响计划的合作，此IDS公告是研究组织，发展机构和捐助者致力于更好地利用证据和学习发展的所有人的重要阅读。探索研究 - 在国际发展中的研究 - 政策伙伴关系James Georgalakis，Pauline Rose Doi：10.19088 / 1968-2019.100 Abstract完整发行PDF前言戴安娜DALTON DOI：10.19088 / 1968-2019.102抽象PDFONLINE文章简介：确定国际发展中研究政策伙伴关系的质量 - 新的分析框架詹姆斯·乔治拉斯·罗琳·罗拉克斯：10.19088 / 1968-2019.103抽象Pdfonline文章通过公平和公平合作伙伴关系的原则重新思考研究影响凯特纽曼，Sowmyaa Bharadwaj，Jude Fransman Do：10.19088 / 1968-2019.104抽象Pdfonline文章途径影响：来自乌干达和印度Rachel Hinton，RONA Bronwin，Laura Bronwin的研究伙伴关系的见解：10.19088 / 1968-2019.105抽象的PDFonline文章探讨了学术界和残疾人士组织之间的伙伴关系：从非洲玛丽亚凯特的合作研究中吸取的经验教训。克鲁，约翰 - 博斯科阿斯蒂姆韦，理查德Bwalya，安德rson gitonga，Boakai A. Nyehn，Joyce Olenja，Leslie Swartz，Nora Groce Doi：10.19088 / 1968-2019.106 Abstract Pdfonline文章分层和链接研究合作伙伴关系：从您的世界研究中学习在埃塞俄比亚和尼泊尔Vicky Johnson，Anannia Admassu，安德鲁教堂的世界研究， JILL HEALELY，SUJETA MATHEMA DOI：10.19088 / 1968-2019.107抽象PDFONLINE文章在学术界政府伙伴关系中的基本挑战在冲突研究中埃塞俄比亚牧场的牧区牧场地区牧羊犬MULUGETA，宇瓦·塔利克（Ekal Nettir Doi：10.19088 / 1968-2019.108摘要PDFONLINE文章区域研究 - 健康股权和包容性发展的政策伙伴关系：关于南部非洲观点的机遇和挑战的思考

Learning to Explore via Meta-Policy Gradient

摘要

著录项

相似文献

相关主题

期刊订阅