首页> 外文会议>IEEE EMBS International Conference on Biomedical and Health Informatics >Automated identification of pediatric appendicitis score in emergency department notes using natural language processing
【24h】

Automated identification of pediatric appendicitis score in emergency department notes using natural language processing

机译:使用自然语言处理功能自动识别急诊科笔记中的小儿阑尾炎评分

获取原文

摘要

Objective: The goal of this project was development of a software tool to detect documentation of Pediatric Appendicitis Score (PAS) within electronic emergency department (ED) notes. The overarching purpose was assessment of diagnostic imaging practices when PAS falls outside of a certain range, since minimizing patients' radiation exposure is desired. Methods: 15074 ED notes were collected from visits between July 2011-Aug. 2016. Notes were labeled as having PAS documented (PAS+) or not (PAS-). 12562 semistructured notes were split into 60% training, 20% validation, and 20% testing. An automated procedure was developed to label data, preprocess notes, extract features, construct three classification models, and compare the models. The selected model was also evaluated on a second testing set of 2512 hand-labeled (BN) unstructured notes using F1-score. Results: The Logistic Regression (LR) model was selected for best F1-score on the validation set (0.9874). This model's F1-score on the human-labeled testing set of unstructured data (0.8391) outperformed the previous method (0.3435). Discussion: The selected LR model demonstrated an improvement upon the previous method when evaluated on manually labeled unstructured data (no overlap in 95% CI). Conclusion: While the LR classifier was trained and selected in an automated way, it still performed well compared to human performance. This tool can be used to expedite manual chart review for identification of PAS within ED notes.
机译:目的:该项目的目标是开发一种软​​件工具,以检测电子急诊科(ED)笔记中的小儿阑尾炎评分(PAS)文档。总体目的是评估PAS超出一定范围时的诊断成像实践,因为需要最大程度地减少患者的辐射暴露。方法:从2011年7月至8月之间的访问中收集了15074 ED笔记。 2016年。注释被标记为已记录PAS(PAS +)或未记录(PAS-)。 12562个半结构化笔记分为60%训练,20%验证和20%测试。开发了一种自动化程序来标记数据,预处理注释,提取特征,构建三个分类模型并进行比较。所选模型还使用F1评分在2512个手工标记(BN)非结构化票据的第二个测试集上进行了评估。结果:Logistic回归(LR)模型被选为验证集中(0.9874)的最佳F1得分。该模型在人工标记的非结构化数据(0.8391)上的F1评分优于以前的方法(0.3435)。讨论:当对手动标记的非结构化数据进行评估(95%CI没有重叠)时,所选的LR模型证明了对先前方法的改进。结论:虽然LR分类器是通过自动方式训练和选择的,但与人类的表现相比,它仍然表现良好。此工具可用于加快手动图表检查的速度,以识别ED注释中的PAS。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号