Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task

Oh-hyeon Choung; Sang Wan Lee; Yong Jeong

首页> 外文期刊>Scientific reports. >Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task

【24h】

Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task

机译：探索功能维度以在不知情的强化学习任务中学习新策略

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

When making a choice with limited information, we explore new features through trial-and-error to learn how they are related. However, few studies have investigated exploratory behaviour when information is limited. In this study, we address, at both the behavioural and neural level, how, when, and why humans explore new feature dimensions to learn a new policy for choosing a state-space. We designed a novel multi-dimensional reinforcement learning task to encourage participants to explore and learn new features, then used a reinforcement learning algorithm to model policy exploration and learning behaviour. Our results provide the first evidence that, when humans explore new feature dimensions, their values are transferred from the previous policy to the new online (active) policy, as opposed to being learned from scratch. We further demonstrated that exploration may be regulated by the level of cognitive ambiguity, and that this process might be controlled by the frontopolar cortex. This opens up new possibilities of further understanding how humans explore new features in an open-space with limited information.

机译：在信息有限的情况下进行选择时，我们会通过反复试验来探索新功能，以了解它们之间的关系。但是，很少有研究调查信息有限时的探索行为。在这项研究中，我们在行为和神经层面都研究了人类如何，何时以及为什么探索新的特征维度，以学习选择状态空间的新策略。我们设计了一种新颖的多维强化学习任务，以鼓励参与者探索和学习新功能，然后使用强化学习算法为政策探索和学习行为建模。我们的结果提供了第一个证据，即当人们探索新的特征尺寸时，其价值会从以前的策略转移到新的在线（主动）策略，而不是从头开始学习。我们进一步证明，探索可能受认知歧义程度的调节，而这一过程可能受额叶皮层控制。这为进一步了解人类如何在有限的信息空间中探索新功能开辟了新的可能性。

著录项

来源
《Scientific reports.》 |2017年第1期|共页
作者
Oh-hyeon Choung; Sang Wan Lee; Yong Jeong;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Reinforcement Learning endowed with safe veto policies to learn the control of Linked-Multicomponent Robotic Systems [J] . Fernandez-Gauna Borja, Grana Manuel, Manuel Lopez-Guede Jose, Information Sciences: An International Journal . 2015,第Null期

机译：具有安全否决权政策的强化学习可学习链接多组件机器人系统的控制
2. Exploring and Exploiting Uncertainty: Statistical Learning Ability Affects How We Learn to Process Language Along Multiple Dimensions of Experience [J] . Dagmar Divjak, Petar Milin Cognitive science . 2020,第5期

机译：探索和利用不确定性：统计学习能力影响我们如何学习如何沿着多种体验维度处理语言
3. Reinforcement learning based on local state feature learning and policy adjustment [J] . Lin YP., Li XY. Information Sciences: An International Journal . 2003,第1a2期

机译：基于局部状态特征学习和策略调整的强化学习
4. Using Deep Reinforcement Learning to Learn High-Level Policies on the ATRIAS Biped [C] . Tianyu Li, Hartmut Geyer, Christopher G. Atkeson, 2019 International Conference on Robotics and Automation . 2019

机译：使用深度强化学习在Biped的ATRIAS上学习高级策略
5. Feature-based local policy reinforcement learning. [D] . Feltenberger, David. 2009

机译：基于特征的地方政策强化学习。
6. Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task [O] . Oh-hyeon Choung, Sang Wan Lee, Yong Jeong -1

机译：探索功能维度以在不知情的强化学习任务中学习新策略
7. Home > All Content > Vol 50, No 1 (2019) Exploring Research–Policy Partnerships in International Development Cover Page Edited by: James Georgalakis and Pauline Rose June 2019 Volume 50 Issue 1 This issue aims to identify how partnerships focused on the production of policy-engaged research seek to achieve societal impact and explores the challenges in these processes. The collaborations analysed span academia, civil society and government, from the grassroots to the national and global levels. By locating these examples within the broader debates on interactions between researchers and research users designed to strengthen evidence informed decision making, this publication offers concepts and practices to inform those funding, designing and undertaking development research. The featured case studies are explored through the perspectives of both researchers and their partners in civil society and policy. They are predominantly taken from a diverse portfolio of research projects funded through the UK’s Economic and Social Research Council (ESRC) and the Department for International Development (DFID) Strategic Partnership. A collaboration with the Impact Initiative, this IDS Bulletin is essential reading for all those in research organisations, development agencies and donors committed to the better use of evidence and learning for development. Exploring Research–Policy Partnerships in International Development James Georgalakis, Pauline Rose DOI: 10.19088/1968-2019.100 ABSTRACT FULL ISSUE PDF Foreword Diana Dalton DOI: 10.19088/1968-2019.102 ABSTRACT PDFONLINE ARTICLE Introduction: Identifying the Qualities of Research–Policy Partnerships in International Development – A New Analytical Framework James Georgalakis, Pauline Rose DOI: 10.19088/1968-2019.103 ABSTRACT PDFONLINE ARTICLE Rethinking Research Impact through Principles for Fair and Equitable Partnerships Kate Newman, Sowmyaa Bharadwaj, Jude Fransman DOI: 10.19088/1968-2019.104 ABSTRACT PDFONLINE ARTICLE Pathways to Impact: Insights from Research Partnerships in Uganda and India Rachel Hinton, Rona Bronwin, Laura Savage DOI: 10.19088/1968-2019.105 ABSTRACT PDFONLINE ARTICLE Exploring Partnerships between Academia and Disabled Persons’ Organisations: Lessons Learned from Collaborative Research in Africa Maria Kett, Mark T. Carew, John-Bosco Asiimwe, Richard Bwalya, Anderson Gitonga, Boakai A. Nyehn, Joyce Olenja, Leslie Swartz, Nora Groce DOI: 10.19088/1968-2019.106 ABSTRACT PDFONLINE ARTICLE Layered and Linking Research Partnerships: Learning from YOUR World Research in Ethiopia and Nepal Vicky Johnson, Anannia Admassu, Andrew Church, Jill Healey, Sujeeta Mathema DOI: 10.19088/1968-2019.107 ABSTRACT PDFONLINE ARTICLE Fundamental Challenges in Academic–Government Partnership in Conflict Research in the Pastoral Lowlands of Ethiopia Mercy Fekadu Mulugeta, Fana Gebresenbet, Yonas Tariku, Ekal Nettir DOI: 10.19088/1968-2019.108 ABSTRACT PDFONLINE ARTICLE Regional Research–Policy Partnerships for Health Equity and Inclusive Development: Reflections on Opportunities and Challenges from a Southern African Perspective [O] . Nicola Yeates, Themba Moeti, Mubita Luwabelwa 2019

机译：主页>所有内容> Vol 50，第1号（2019年）探索国际发展顾客的研究 - 政策伙伴关系：詹姆斯格拉巴拉斯和波雷琳上升2019年6月50卷第1卷这个问题旨在确定合作伙伴关系如何关注政策的生产。 - 繁殖的研究寻求实现社会影响并探索这些过程中的挑战。合作分析了跨度学术界，民间社会和政府，从基层到国家和全球层面。通过在研究人员和研究用户之间的相互作用的更广泛辩论中找到这些示例，旨在加强证据明智的决策，本出版物提供了通知这些资金，设计和开发研究的概念和实践。通过研究人员及其合作伙伴在民间社会和政策的角度来看，探讨了特色案例研究。它们主要从英国经济和社会研究委员会（ESRC）和国际发展部（DFID）战略伙伴部资助的各种研究项目组合中获取。与影响计划的合作，此IDS公告是研究组织，发展机构和捐助者致力于更好地利用证据和学习发展的所有人的重要阅读。探索研究 - 在国际发展中的研究 - 政策伙伴关系James Georgalakis，Pauline Rose Doi：10.19088 / 1968-2019.100 Abstract完整发行PDF前言戴安娜DALTON DOI：10.19088 / 1968-2019.102抽象PDFONLINE文章简介：确定国际发展中研究政策伙伴关系的质量 - 新的分析框架詹姆斯·乔治拉斯·罗琳·罗拉克斯：10.19088 / 1968-2019.103抽象Pdfonline文章通过公平和公平合作伙伴关系的原则重新思考研究影响凯特纽曼，Sowmyaa Bharadwaj，Jude Fransman Do：10.19088 / 1968-2019.104抽象Pdfonline文章途径影响：来自乌干达和印度Rachel Hinton，RONA Bronwin，Laura Bronwin的研究伙伴关系的见解：10.19088 / 1968-2019.105抽象的PDFonline文章探讨了学术界和残疾人士组织之间的伙伴关系：从非洲玛丽亚凯特的合作研究中吸取的经验教训。克鲁，约翰 - 博斯科阿斯蒂姆韦，理查德Bwalya，安德rson gitonga，Boakai A. Nyehn，Joyce Olenja，Leslie Swartz，Nora Groce Doi：10.19088 / 1968-2019.106 Abstract Pdfonline文章分层和链接研究合作伙伴关系：从您的世界研究中学习在埃塞俄比亚和尼泊尔Vicky Johnson，Anannia Admassu，安德鲁教堂的世界研究， JILL HEALELY，SUJETA MATHEMA DOI：10.19088 / 1968-2019.107抽象PDFONLINE文章在学术界政府伙伴关系中的基本挑战在冲突研究中埃塞俄比亚牧场的牧区牧场地区牧羊犬MULUGETA，宇瓦·塔利克（Ekal Nettir Doi：10.19088 / 1968-2019.108摘要PDFONLINE文章区域研究 - 健康股权和包容性发展的政策伙伴关系：关于南部非洲观点的机遇和挑战的思考
8. Learning State Features from Policies to Bias Exploration in Reinforcement Learning [R] . Singer, B. , Veloso, M. 1999

机译：学习国家特色从政策到强化学习中的偏见探索

Exploring Feature Dimensions to Learn a New Policy in an Uninformed Reinforcement Learning Task

摘要

著录项

相似文献

相关主题

期刊订阅