Using machine learning for predicting cervical cancer from Swedish electronic health records by mining hierarchical representations

Rebecka Weegar; Karin Sundstr?m

首页> 外文期刊>PLoS One >Using machine learning for predicting cervical cancer from Swedish electronic health records by mining hierarchical representations

【24h】

Using machine learning for predicting cervical cancer from Swedish electronic health records by mining hierarchical representations

机译：采用机器学习通过采矿等级表示从瑞典电子健康记录预测宫颈癌

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Electronic health records (EHRs) contain rich documentation regarding disease symptoms and progression, but EHR data is challenging to use for diagnosis prediction due to its high dimensionality, relative scarcity, and substantial level of noise. We investigated how to best represent EHR data for predicting cervical cancer, a serious disease where early detection is beneficial for the outcome of treatment. A case group of 1321 patients with cervical cancer were matched to ten times as many controls, and for both groups several types of events were extracted from their EHRs. These events included clinical codes, lab results, and contents of free text notes retrieved using a LSTM neural network. Clinical events are described with great variation in EHR texts, leading to a very large feature space. Therefore, an event hierarchy inferred from the textual events was created to represent the clinical texts. Overall, the events extracted from free text notes contributed the most to the final prediction, and the hierarchy of textual events further improved performance. Four classifiers were evaluated for predicting a future cancer diagnosis where Random Forest achieved the best results with an AUC of 0.70 from a year before diagnosis up to 0.97 one day before diagnosis. We conclude that our approach is sound and had excellent discrimination at diagnosis, but only modest discrimination capacity before this point. Since our study objective was earlier disease prediction than such, we propose further work should consider extending patient histories through e.g. the integration of primary health records preceding referral to hospital.

机译：电子健康记录（EHRS）包含有关疾病症状和进展的丰富文档，但由于其高维度，相对稀缺和大量噪声水平，EHR数据用于诊断预测。我们调查了如何最好地代表预测宫颈癌的EHR数据，这是一种严重的疾病，早期检测对于治疗结果有益。一个宫颈癌患者的案例组与许多对照的十倍次，并且对于这两个组，从其EHR中提取了几种类型的事件。这些事件包括使用LSTM神经网络检索的临床代码，实验室结果和自由文本笔记的内容。临床事件描述了EHR文本的巨大变化，导致非常大的特征空间。因此，创建从文本事件推断的事件层次结构以表示临床文本。总的来说，从自由文本笔记中提取的事件为最终预测贡献了最大的预测，以及文本事件的层次结构进一步提高了性能。评估了四种分类剂，以预测未来癌症诊断，其中随机森林从一年前诊断前一年的AUC达到了最佳效果，每天诊断前一天达到0.97。我们得出结论，我们的方法是声音，在诊断中具有良好的歧视，但在这一点之前只有适度的歧视能力。由于我们的研究目标是早期的疾病预测，我们提出进一步的工作应考虑通过例如延伸患者历史。转介前往医院的初级健康记录的整合。

著录项

来源
《PLoS One》 |2020年第8期|共19页
作者
Rebecka Weegar; Karin Sundstr?m;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类医药、卫生;
关键词

相似文献

外文文献
中文文献
专利

1. Data Mining in Spine Surgery: Leveraging Electronic Health Records for Machine Learning and Clinical Research [J] . Victor E. Staartjes, Martin N. Stienen Neurospine. . 2019,第4期

机译：脊柱外科的数据挖掘：利用电子健康记录进行机器学习和临床研究
2. A hybrid machine learning framework to predict mortality in paralytic ileus patients using electronic health records (EHRs) [J] . Ahmad Fahad Shabbir, Ali Liaqat, Raza-Ul-Mustafa, Journal of ambient intelligence and humanized computing . 2021,第3期

机译：混合机器学习框架，以预测利用电子健康记录（EHRS）的麻痹性髂骨患者死亡率
3. Predicting the Risk of Inpatient Hypoglycemia With Machine Learning Using Electronic Health Records [J] . Diabetes care . 2020,第7期

机译：使用电子健康记录预测机器学习的住院性低血糖的风险
4. Applying deep learning on electronic health records in Swedish to predict healthcare-associated infections [C] . Olof Jacobson, Hercules Dalianis 15th workshop on biomedical natural language processing . 2016

机译：在瑞典人的电子健康记录中应用深度学习来预测与医疗保健相关的感染
5. Predicting Drug Misuse Status Using Machine Learning on Electronic Health Records [D] . Kania, Robert. 2020

机译：使用机器学习在电子健康记录上预测药物滥用状态
6. Using machine learning for predicting cervical cancer from Swedish electronic health records by mining hierarchical representations [O] . Rebecka Weegar, Karin Sundström 2020

机译：利用机器学习通过采矿等级表示从瑞典电子健康记录预测宫颈癌
7. Machine learning model to predict mental health crisis from electronic health records [O] . Roger Garriga, Aleksandar Matić, Javier Mas, 2021

机译：机器学习模式预测电子健康记录心理健康危机

Using machine learning for predicting cervical cancer from Swedish electronic health records by mining hierarchical representations

摘要

著录项

相似文献

相关主题

期刊订阅