首页> 美国卫生研究院文献>Frontiers in Neuroscience >A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces
【2h】

A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces

机译:在基于行为者的强化学习的脑机界面中使用神经生物学反馈的置信度度量

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Brain-Machine Interfaces (BMIs) can be used to restore function in people living with paralysis. Current BMIs require extensive calibration that increase the set-up times and external inputs for decoder training that may be difficult to produce in paralyzed individuals. Both these factors have presented challenges in transitioning the technology from research environments to activities of daily living (ADL). For BMIs to be seamlessly used in ADL, these issues should be handled with minimal external input thus reducing the need for a technician/caregiver to calibrate the system. Reinforcement Learning (RL) based BMIs are a good tool to be used when there is no external training signal and can provide an adaptive modality to train BMI decoders. However, RL based BMIs are sensitive to the feedback provided to adapt the BMI. In actor-critic BMIs, this feedback is provided by the critic and the overall system performance is limited by the critic accuracy. In this work, we developed an adaptive BMI that could handle inaccuracies in the critic feedback in an effort to produce more accurate RL based BMIs. We developed a confidence measure, which indicated how appropriate the feedback is for updating the decoding parameters of the actor. The results show that with the new update formulation, the critic accuracy is no longer a limiting factor for the overall performance. We tested and validated the system onthree different data sets: synthetic data generated by an Izhikevich neural spiking model, synthetic data with a Gaussian noise distribution, and data collected from a non-human primate engaged in a reaching task. All results indicated that the system with the critic confidence built in always outperformed the system without the critic confidence. Results of this study suggest the potential application of the technique in developing an autonomous BMI that does not need an external signal for training or extensive calibration.
机译:脑机接口(BMI)可用于恢复瘫痪患者的功能。当前的BMI需要大量的校准,这会增加设置时间,并需要外部输入以进行解码器训练,而这在瘫痪的个体中可能很难产生。这两个因素都给将技术从研究环境过渡到日常生活活动(ADL)带来了挑战。对于要在ADL中无缝使用的BMI,应使用最少的外部输入来处理这些问题,从而减少技术人员/护理人员对系统进行校准的需求。当没有外部训练信号时,基于强化学习(RL)的BMI是一个很好的工具,并且可以提供自适应模式来训练BMI解码器。但是,基于RL的BMI对提供的适应BMI的反馈很敏感。在批评演员的BMI中,此反馈是由评论者提供的,整个系统的性能受到评论者准确性的限制。在这项工作中,我们开发了一种自适应BMI,该BMI可以处理评论者反馈中的不正确之处,以期产生更准确的基于RL的BMI。我们开发了一种置信度度量,该度量表明反馈对于更新actor的解码参数有多合适。结果表明,使用新的更新公式,批注者的准确性不再是整体性能的限制因素。我们通过三种不同的数据集对系统进行了测试和验证:由Izhikevich神经峰值模型生成的合成数据,具有高斯噪声分布​​的合成数据以及从从事到达任务的非人类灵长类动物收集的数据。所有结果表明,建立了带有批评者信心的系统总是优于没有批评者信心的系统。这项研究的结果表明该技术在开发不需要外部信号进行训练或广泛校准的自主BMI中的潜在应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号