首页> 外文会议>IEEE EMBS International Conference on Biomedical and Health Informatics >Automated histologic grading from free-text pathology reports using graph-of-words features and machine learning
【24h】

Automated histologic grading from free-text pathology reports using graph-of-words features and machine learning

机译:使用词图功能和机器学习对自由文本病理报告进行自动组织学分级

获取原文

摘要

Traditional n-gram feature representation of freetext documents often fails to capture word ordering and semantics, thus compromising text comprehension. Graph-of-words, a new text representation approach based on graph analytics, is a superior method overcoming the limitations by modeling word co-occurrence. In this study, we present a novel application of graph-of-words text description for automated extraction of histologic grade from unstructured pathology reports. Using 10-fold cross-validation tests, the proposed approach resulted in substantially higher macro and micro-F1 scores with undirected graph-of-words features, compared to traditional bi-gram text features. Our feasibility study demonstrated that graph-of-words is a highly efficient method of text comprehension for information extraction from free-text clinical documents.
机译:自由文本文档的传统n-gram特征表示通常无法捕获单词顺序和语义,从而损害了文本理解能力。词图(Graph-of-words)是一种基于图分析的新文本表示方法,是一种通过对词共现进行建模来克服局限性的高级方法。在这项研究中,我们提出了词图文字说明在从非结构化病理报告中自动提取组织学等级的新应用。与传统的Bi-gram文字功能相比,使用10倍交叉验证测试,所提出的方法所产生的宏观和微观F1分数具有无方向的字图特征。我们的可行性研究表明,单词图是从自由文本临床文档中提取信息的高效文本理解方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号