...
首页> 外文期刊>Artificial intelligence in medicine >Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis
【24h】

Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis

机译:一种新的加固学习模型优化血腥患者血清患者的新加固学习模型的跨大海可转移性

获取原文
获取原文并翻译 | 示例
           

摘要

Introduction: In recent years, reinforcement learning (RL) has gained traction in the healthcare domain. In particular, RL methods have been explored for haemodynamic optimization of septic patients in the Intensive Care Unit. Most hospitals however, lack the data and expertise for model development, necessitating transfer of models developed using external datasets. This approach assumes model generalizability across different patient populations, the validity of which has not previously been tested. In addition, there is limited knowledge on safety and reliability. These challenges need to be addressed to further facilitate implementation of RL models in clinical practice.Method: We developed and validated a new reinforcement learning model for hemodynamic optimization in sepsis on the MIMIC intensive care database from the USA using a dueling double deep Q network. We then transferred this model to the European AmsterdamUMCdb intensive care database. T-Distributed Stochastic Neighbor Embedding and Sequential Organ Failure Assessment scores were used to explore the differences between the patient populations. We apply off-policy policy evaluation methods to quantify model performance. In addition, we introduce and apply a novel deep policy inspection to analyse how the optimal policy relates to the different phases of sepsis and sepsis treatment to provide interpretable insight in order to assess model safety and reliability.Results: The off-policy evaluation revealed that the optimal policy outperformed the physician policy on both datasets despite marked differences between the two patient populations and physician's policies. Our novel deep policy inspection method showed insightful results and unveiled that the model could initiate therapy adequately and adjust therapy intensity to illness severity and disease progression which indicated safe and reliable model behaviour. Compared to current physician behavior, the developed policy prefers a more liberal use of vasopressors with a more restrained use of fluid therapy in line with previous work.Conclusion: We created a reinforcement learning model for optimal bedside hemodynamic management and demonstrated model transferability between populations from the USA and Europe for the first time. We proposed new methods for deep policy inspection integrating expert domain knowledge. This is expected to facilitate progression to bedside clinical decision support for the treatment of critically ill patients.
机译:介绍:近年来,加固学习(RL)在医疗领域获得了牵引力。特别是,RL方法已经探讨了重症监护单元中脓毒症患者的血流动力学优化。然而,大多数医院缺乏模型开发的数据和专业知识,需要使用外部数据集开发的模型的转移。该方法假设模拟不同患者群体的普遍性,其有效性尚未测试过。此外,关于安全性和可靠性有限。需要解决这些挑战,以进一步促进临床实践中的RL模型。方法:我们使用Dueling Double Double Q网络从美国模仿密集护理数据库的败血力优化的新加固学习模型。然后,我们将此模型转移到欧洲Amsterdamumcdb重症监护数据库。 T分布式随机邻居嵌入和顺序器官失效评估分数用于探讨患者人群之间的差异。我们申请违规策略评估方法来量化模型性能。此外,我们介绍并应用了一个新的深度政策检查,分析了最佳政策如何与败血症和败血症治疗的不同阶段有关,以提供可解释的洞察力,以评估模型安全性和可靠性。结果:违规评估显示尽管两名患者人口和医师的政策有显着差异,但最佳政策可能表现出对两种数据集的医生政策。我们的新型深度政策检测方法显示了富有识别结果,推出了该模型可以充分发起治疗,并调整治疗强度与疾病严重程度和疾病进展,这表明了安全可靠的模型行为。与目前的医生行为相比,发达的政策更让血管加压器更加自由地使用液体治疗液体治疗符合以前的工作。结论:我们创建了一种加强学习模型,可实现最佳床边血液动力学管理,并展示人口之间的模型可转化性美国和欧洲第一次。我们提出了新的深度政策检查方法,整合专家领域知识。这有望促进对患者治疗患者的睡眠临床决策的进展。

著录项

  • 来源
    《Artificial intelligence in medicine》 |2021年第2期|102003.1-102003.17|共17页
  • 作者单位

    Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands|Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands|Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

    Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Sepsis; Reinforcement learning; Deep Q learning; ICU;

    机译:脓毒症;强化学习;深Q学习;ICU;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号