Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis

Roggeveen Luca; el Hassouni Ali; Ahrendt Jonas; Guo Tingjie; Fleuren Lucas; Thoral Patrick; Girbes Armand R. J.; Hoogendoorn Mark; Elbers Paul W. G.

首页> 外文期刊>Artificial intelligence in medicine >Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis

【24h】

Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis

机译：一种新的加固学习模型优化血腥患者血清患者的新加固学习模型的跨大海可转移性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Introduction: In recent years, reinforcement learning (RL) has gained traction in the healthcare domain. In particular, RL methods have been explored for haemodynamic optimization of septic patients in the Intensive Care Unit. Most hospitals however, lack the data and expertise for model development, necessitating transfer of models developed using external datasets. This approach assumes model generalizability across different patient populations, the validity of which has not previously been tested. In addition, there is limited knowledge on safety and reliability. These challenges need to be addressed to further facilitate implementation of RL models in clinical practice.Method: We developed and validated a new reinforcement learning model for hemodynamic optimization in sepsis on the MIMIC intensive care database from the USA using a dueling double deep Q network. We then transferred this model to the European AmsterdamUMCdb intensive care database. T-Distributed Stochastic Neighbor Embedding and Sequential Organ Failure Assessment scores were used to explore the differences between the patient populations. We apply off-policy policy evaluation methods to quantify model performance. In addition, we introduce and apply a novel deep policy inspection to analyse how the optimal policy relates to the different phases of sepsis and sepsis treatment to provide interpretable insight in order to assess model safety and reliability.Results: The off-policy evaluation revealed that the optimal policy outperformed the physician policy on both datasets despite marked differences between the two patient populations and physician's policies. Our novel deep policy inspection method showed insightful results and unveiled that the model could initiate therapy adequately and adjust therapy intensity to illness severity and disease progression which indicated safe and reliable model behaviour. Compared to current physician behavior, the developed policy prefers a more liberal use of vasopressors with a more restrained use of fluid therapy in line with previous work.Conclusion: We created a reinforcement learning model for optimal bedside hemodynamic management and demonstrated model transferability between populations from the USA and Europe for the first time. We proposed new methods for deep policy inspection integrating expert domain knowledge. This is expected to facilitate progression to bedside clinical decision support for the treatment of critically ill patients.

机译：介绍：近年来，加固学习（RL）在医疗领域获得了牵引力。特别是，RL方法已经探讨了重症监护单元中脓毒症患者的血流动力学优化。然而，大多数医院缺乏模型开发的数据和专业知识，需要使用外部数据集开发的模型的转移。该方法假设模拟不同患者群体的普遍性，其有效性尚未测试过。此外，关于安全性和可靠性有限。需要解决这些挑战，以进一步促进临床实践中的RL模型。方法：我们使用Dueling Double Double Q网络从美国模仿密集护理数据库的败血力优化的新加固学习模型。然后，我们将此模型转移到欧洲Amsterdamumcdb重症监护数据库。 T分布式随机邻居嵌入和顺序器官失效评估分数用于探讨患者人群之间的差异。我们申请违规策略评估方法来量化模型性能。此外，我们介绍并应用了一个新的深度政策检查，分析了最佳政策如何与败血症和败血症治疗的不同阶段有关，以提供可解释的洞察力，以评估模型安全性和可靠性。结果：违规评估显示尽管两名患者人口和医师的政策有显着差异，但最佳政策可能表现出对两种数据集的医生政策。我们的新型深度政策检测方法显示了富有识别结果，推出了该模型可以充分发起治疗，并调整治疗强度与疾病严重程度和疾病进展，这表明了安全可靠的模型行为。与目前的医生行为相比，发达的政策更让血管加压器更加自由地使用液体治疗液体治疗符合以前的工作。结论：我们创建了一种加强学习模型，可实现最佳床边血液动力学管理，并展示人口之间的模型可转化性美国和欧洲第一次。我们提出了新的深度政策检查方法，整合专家领域知识。这有望促进对患者治疗患者的睡眠临床决策的进展。

著录项

来源
《Artificial intelligence in medicine》 |2021年第2期|102003.1-102003.17|共17页
作者
Roggeveen Luca; el Hassouni Ali; Ahrendt Jonas; Guo Tingjie; Fleuren Lucas; Thoral Patrick; Girbes Armand R. J.; Hoogendoorn Mark; Elbers Paul W. G.;
展开▼
作者单位

Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands|Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands|Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Dept Comp Sci Quantitat Data Analyt Grp De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

Vrije Univ Amsterdam Infect & Immun Inst AI&II Dept Intens Care Med Amsterdam Med Data Sci AMDS Amsterdam UMC Locat VUmc Res VUmc Intens Care Rev De Boelelaan 1117 NL-1081 HV Amsterdam Netherlands;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Sepsis; Reinforcement learning; Deep Q learning; ICU;

机译：脓毒症;强化学习;深Q学习;ICU;

相似文献

外文文献
中文文献
专利

1. Population Pharmacokinetics and Pharmacodynamics Modeling To Optimize Dosage Regimens of Sulbactam in Critically Ill Patients with Severe Sepsis Caused by Acinetobacter baumannii [J] . Jaruratanasirikul Sutep, Wongpoowarak Wibul, Wattanavijitkul Thitima, Antimicrobial agents and chemotherapy. . 2016,第12期

机译：人群药代动力学和药效学模型可优化鲍曼不动杆菌引起的重症脓毒症重症患者的舒巴坦剂量方案
2. A Machine-Learning Approach for Dynamic Prediction of Sepsis-Induced Coagulopathy in Critically Ill Patients With Sepsis [J] . Qin-Yu Zhao, Le-Ping Liu, Jing-Chao Luo, Frontiers in Medicine . 2020,第a期

机译：一种机器学习方法，用于脓毒症患者患者脓毒症诱导的凝血病变的动态预测
3. Management of sepsis and septic shock in critically ill patients transferred by a dedicated transport team in the West of Scotland [J] . A Cadamy, A Binning, I Thomson Critical care : . 2006,第Suppla1期

机译：由苏格兰西部的专门运输团队转移的重症患者的败血症和败血性休克的管理
4. Using Machine Learning to Diagnose Bacterial Sepsis in the Critically Ill Patients [C] . Yang Liu, Kup-Sze Choi International conference on smart health . 2017

机译：使用机器学习诊断重症患者的细菌性败血症
5. Patients with advanced heart failure: A description of the disease trajectory in the study to understand prognoses and preferences for outcomes and risks of treatment in critically ill adults (SUPPORT) [D] . Castle, Jane Elizabeth 2000

机译：晚期心力衰竭患者：描述研究中的疾病轨迹，以了解重症成人的预后和对结局和治疗风险的偏好（支持）
6. Population Pharmacokinetics and Pharmacodynamics Modeling To Optimize Dosage Regimens of Sulbactam in Critically Ill Patients with Severe Sepsis Caused by Acinetobacter baumannii [O] . Sutep Jaruratanasirikul, Wibul Wongpoowarak, Thitima Wattanavijitkul, 2016

机译：人群药代动力学和药效学模型可优化鲍曼不动杆菌引起的重症脓毒症重症患者的舒巴坦剂量方案
7. Insulin sensitivity and sepsis score: A correlation between model-based metric and sepsis scoring system in critically ill patients [O] . Fatanah M. Suhaimi, J. Geoffrey Chase, Christopher G. Pretty, 2017

机译：胰岛素敏感性和败血症得分：患有危重病患者模型的度量和脓毒症评分系统的相关性

Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis

摘要

著录项

相似文献

相关主题

期刊订阅