首页> 外文会议>German Association of Medical Informatics, Biometry and Epidemiology., Annual Meeting >Deep Learning Approaches Outperform Conventional Strategies in DeIdentification of German Medical Reports
【24h】

Deep Learning Approaches Outperform Conventional Strategies in DeIdentification of German Medical Reports

机译:深度学习方法优于德国医疗报告的职外策略

获取原文

摘要

One of the major obstacles for research on German medical reports is the lack of de-identified medical corpora. Previous de-identification tasks focused on non-German medical texts, which raised the demand for an in-depth evaluation of de-identification methods on German medical texts. Because of remarkable advancements in natural language processing using supervised machine learning methods on limited training data, we evaluated them for the first time on German medical reports using our annotated data set consisting of 113 medical reports from the cardiology domain. We applied state-of-the-art deep learning methods using pre-trained models as input to a bidirectional LSTM network and well-established conditional random fields for de-identification of German medical reports. We performed an extensive evaluation for de-identification and multiclass named entity recognition. Using rule based and out of domain machine learning methods as a baseline, the conditional random field improved F2-score from 70 to 93% for de-identification, the neural approach reached 96% in F2-score while keeping balanced precision and recall rates. These results show, that state-of-the-art machine learning methods can play a crucial role in de-identification of German medical reports.
机译:德国医疗报告研究的主要障碍之一是缺乏未确定的医学集团。以前的去识别任务专注于非德国医学文本,这提出了对德国医学文本对去鉴定方法进行深入评估的需求。由于使用监督机器学习方法在有限培训数据上使用监督机器学习方法的显着进步,我们使用由心脏病学域113个医疗报告组成的注释数据集首次在德国医疗报告上评估它们。我们使用预先训练的模型应用了最先进的深度学习方法,作为双向LSTM网络的输入和已建立的条件随机字段,用于去识别德国医疗报告。我们对De-Idite和MultiClass命名实体识别进行了广泛的评估。使用基于规则和域机学习方法作为基线,条件随机场从70%到93%改善了去识别的分数,神经方法在F2分数达到96%,同时保持平衡的精度和召回速率。这些结果表明,最先进的机器学习方法可以在去识别德国医学报告中发挥至关重要的作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号