首页> 美国卫生研究院文献>Journal of the American Medical Informatics Association : JAMIA >Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents
【2h】

Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents

机译:使用通用否定检测对医学文件进行增强概念索引

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Objectives: To test the hypothesis that most instances of negated concepts in dictated medical documents can be detected by a strategy that relies on tools developed for the parsing of formal (computer) languages—specifically, a lexical scanner (“lexer”) that uses regular expressions to generate a finite state machine, and a parser that relies on a restricted subset of context-free grammars, known as LALR(1) grammars.>Methods: A diverse training set of 40 medical documents from a variety of specialties was manually inspected and used to develop a program (Negfinder) that contained rules to recognize a large set of negated patterns occurring in the text. Negfinder's lexer and parser were developed using tools normally used to generate programming language compilers. The input to Negfinder consisted of medical narrative that was preprocessed to recognize UMLS concepts: the text of a recognized concept had been replaced with a coded representation that included its UMLS concept ID. The program generated an index with one entry per instance of a concept in the document, where the presence or absence of negation of that concept was recorded. This information was used to mark up the text of each document by color-coding it to make it easier to inspect. The parser was then evaluated in two ways: 1) a test set of 60 documents (30 discharge summaries, 30 surgical notes) marked-up by Negfinder was inspected visually to quantify false-positive and false-negative results; and 2) a different test set of 10 documents was independently examined for negatives by a human observer and by Negfinder, and the results were compared.>Results: In the first evaluation using marked-up documents, 8,358 instances of UMLS concepts were detected in the 60 documents, of which 544 were negations detected by the program and verified by human observation (true-positive results, or TPs). Thirteen instances were wrongly flagged as negated (false-positive results, or FPs), and the program missed 27 instances of negation (false-negative results, or FNs), yielding a sensitivity of 95.3 percent and a specificity of 97.7 percent. In the second evaluation using independent negation detection, 1,869 concepts were detected in 10 documents, with 135 TPs, 12 FPs, and 6 FNs, yielding a sensitivity of 95.7 percent and a specificity of 91.8 percent. One of the words “no,” “denies/denied,” “not,” or “without” was present in 92.5 percent of all negations.>Conclusions: Negation of most concepts in medical narrative can be reliably detected by a simple strategy. The reliability of detection depends on several factors, the most important being the accuracy of concept matching.
机译:>目标:为了检验这一假设,即可以通过一种策略来检测口述医学文档中大多数否定概念的实例,该策略依赖于为解析正式​​(计算机)语言而开发的工具,特别是词法扫描器( “ lexer”),使用正则表达式生成有限状态机,以及使用依赖于上下文无关文法的受限子集的解析器,称为LALR(1)文法。>方法:手动检查了一组来自各个专业的40份医学文档,并将其用于开发程序(Negfinder),该程序包含用于识别文本中出现的大量否定模式的规则。 Negfinder的词法分析器和解析器是使用通常用于生成编程语言编译器的工具开发的。 Negfinder的输入内容包括经过处理以识别UMLS概念的医学叙述:已识别概念的文本已替换为包含UMLS概念ID的编码表示形式。该程序生成一个索引,其中每个概念实例在文档中都有一个条目,其中记录了该概念是否存在否定。此信息用于通过彩色编码来标记每个文档的文本,以使其更易于检查。然后以两种方式评估分析器:1)目视检查由Negfinder标记的60个文档(30个出院总结,30个手术记录)的测试集,以量化假阳性和假阴性结果; 2)由人类观察者和Negfinder独立检查了10个文档的不同测试集的阴性结果,并比较了结果。>结果:在第一次使用带标记的文档的评估中,有8358个实例在60份文档中检测到UMLS概念中的UMLS概念,其中544项是由程序检测到的否定,并通过人工观察(真阳性结果或TP)进行了验证。 13个实例被错误地标记为否定(假阳性结果或FP),并且该程序错过了27个否定实例(假阴性结果或FN),灵敏度为95.3%,特异性为97.7%。在第二次使用独立否定检测的评估中,在10个文档中检测到1,869个概念,其中包含135个TP,12个FP和6个FN,灵敏度为95.7%,特异性为91.8%。在所有否定词中,有92.5%的人使用“否”,“否定/否定”,“否”或“没有”一词。>结论:对医学叙事中大多数概念的否定是可靠的通过简单的策略进行检测。检测的可靠性取决于几个因素,最重要的是概念匹配的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号