Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems

机译：使用本体的表型注释：训练和评估自然语言处理系统的金标准

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Natural language descriptions of organismal phenotypes, a principal object of study in biology, are abundant in the biological literature. Expressing these phenotypes as logical statements using ontologies would enable large-scale analysis on phenotypic information from diverse systems. However, considerable human effort is required to make these phenotype descriptions amenable to machine reasoning. Natural language processing tools have been developed to facilitate this task, and the training and evaluation of these tools depend on the availability of high quality, manually annotated gold standard data sets. We describe the development of an expert-curated gold standard data set of annotated phenotypes for evolutionary biology. The gold standard was developed for the curation of complex comparative phenotypes for the Phenoscape project. It was created by consensus among three curators and consists of entity–quality expressions of varying complexity. We use the gold standard to evaluate annotations created by human curators and those generated by the Semantic CharaParser tool. Using four annotation accuracy metrics that can account for any level of relationship between terms from two phenotype annotations, we found that machine–human consistency, or similarity, was significantly lower than inter-curator (human–human) consistency. Surprisingly, allowing curatorsaccess to external information did not significantly increase the similarity of their annotations to the gold standard or have a significant effect on inter-curator consistency. We found that the similarity of machine annotations to the gold standard increased after new relevant ontology terms had been added. Evaluation by the original authors of the character descriptions indicated that the gold standard annotations came closer to representing their intended meaning than did either the curator or machine annotations. These findings point toward ways to better design software to augment human curators and the use of the gold standard corpus will allow training and assessment of new tools to improve phenotype annotation accuracy at scale.

机译：生物表型是生物学研究的主要对象，自然语言对生物表型的描述在生物学文献中十分丰富。将这些表型表达为使用本体的逻辑陈述将能够对来自不同系统的表型信息进行大规模分析。但是，要使这些表型描述适合机器推理，需要付出大量的人力。已经开发了自然语言处理工具来促进此任务，并且对这些工具的训练和评估取决于高质量，手动注释的金标准数据集的可用性。我们描述了进化生物学的专家表述的注释表型的金标准数据集的发展。开发了黄金标准，用于管理Phenoscape项目的复杂比较表型。它是由三位策展人之间的共识创建的，由复杂程度不同的实体质量表示形式组成。我们使用黄金标准来评估人类策展人创建的注释以及语义CharaParser工具生成的注释。使用四个注释准确性度量标准，这些度量标准可以解释来自两个表型注释的术语之间的任何级别的关系，我们发现机器-人的一致性或相似性显着低于策展人之间（人-人）的一致性。令人惊讶的是，允许策展人访问外部信息并不会显着增加其注释与黄金标准的相似度，也不会显着影响策展人之间的一致性。我们发现，在添加了新的相关本体术语后，机器注释与黄金标准的相似性增加了。原始作者对字符描述的评估表明，与策展人或机器注释相比，金标准注释更接近于表示其预期含义。这些发现指向更好地设计软件以增强人类策展人的方式，并且使用黄金标准语料库将允许培训和评估新工具，以大规模地提高表型注释的准确性。

著录项

期刊名称 Database: The Journal of Biological Databases and Curation
作者
Wasila Dahdul; Prashanti Manda; Hong Cui; James P Balhoff; T Alexander Dececchi; Nizar Ibrahim; Hilmar Lapp; Todd Vision; Paula M Mabee;
展开▼
作者单位

展开▼
年(卷),期 2018(2018),-1
年度 2018
页码 bay110
总页数 17
原文格式 PDF
正文语种
中图分类生物学;
关键词

相似文献

外文文献
中文文献
专利

1. Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements. [J] . Todd Lingren, Louise Deleger, Katalin Molnar, Journal of the American Medical Informatics Association : . 2014,第3期

机译：评估预批注对批注速度和潜在偏见的影响：在临床试验公告中为临床命名实体识别开发自然语言处理黄金标准。
2. Natural Language Processing methods and systems for biomedical ontology learning. [J] . Liu K, Hogan WR, Crowley RS Journal of biomedical informatics. . 2011,第1期

机译：用于生物医学本体学习的自然语言处理方法和系统。
3. C-C4-02: Using a Natural Language Processor to Remove All Elements of Personal Health Information (PHI) to Deidentify Clinical Annotations for the Specimen Retrieval System (SRS) [J] . Clinical medicine & research. . 2011,第3a4期

机译：C-C4-02：使用自然语言处理器删除个人健康信息（PHI）的所有元素，以消除对标本检索系统（SRS）的临床注释
4. On Evaluation of Natural Language Processing Tasks: Is Gold Standard Evaluation Methodology a Good Solution? [C] . Vojtech Kovar, Milos Jakubicek, Ales Horak International Conference on Agents and Artificial Intelligence . 2016

机译：在评估自然语言处理任务：是黄金标准评估方法的好解决方案吗？
5. Understanding the figurative language of tropes in natural language processing using a brain-based organization for ontologies. [D] . Keuper, Christine M. 2007

机译：使用基于脑的本体论组织在自然语言处理中理解比喻的比喻语言。
6. Evaluating the impact of pre-annotation on annotation speed and potential bias: natural language processing gold standard development for clinical named entity recognition in clinical trial announcements [O] . Todd Lingren, Louise Deleger, Katalin Molnar, 2014

机译：评估预批注对批注速度和潜在偏见的影响：在临床试验公告中为自然语言处理金标准开发的临床命名实体识别
7. Annotation of phenotypes using ontologies: a Gold Standard for the training and evaluation of natural language processing systems [O] . Wasila Dahdul, Prashanti Manda, Hong Cui, 2018

机译：使用本体的表型注释：自然语言处理系统培训和评估的金标准
8. Evaluation Methodology for Natural Language Processing Systems. [R] . Neal, J. G., Feit, E. L., Funke, D. J., 1992

机译：自然语言处理系统的评估方法。

Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems

摘要

著录项

相似文献

相关主题

期刊订阅