首页> 外文期刊>Journal of Bioinformatics and Computational Biology >RAPID PATTERN DEVELOPMENT FOR CONCEPT RECOGNITION SYSTEMS: APPLICATION TO POINT MUTATIONS
【24h】

RAPID PATTERN DEVELOPMENT FOR CONCEPT RECOGNITION SYSTEMS: APPLICATION TO POINT MUTATIONS

机译:概念识别系统的快速模式开发:应用于点突变

获取原文
获取原文并翻译 | 示例
           

摘要

The primary biomedical literature is being generated at an unprecedented rate, and researchers cannot keep abreast of new developments in their fields. Biomedical natural language processing is being developed to address this issue, but building reliable systems often requires many expert-hours. We present an approach for automatically developing collections of regular expressions to drive high-performance concept recognition systems with minimal human interaction. We applied our approach to develop MutationFinder, a system for automatically extracting mentions of point mutations from the text. MutationFinder achieves performance equivalent to or better than manually developed mutation recognition systems, but the generation of its 759 patterns has required only 5.5 expert-hours. We also discuss the development and evaluation of our recently published high-quality, human-annotated gold standard corpus, which contains 1,515 complete point mutation mentions annotated in 813 abstracts.
机译:主要的生物医学文献正在以空前的速度产生,研究人员无法跟上其领域的新发展。正在开发生物医学自然语言处理来解决此问题,但是构建可靠的系统通常需要许多专家时间。我们提出了一种自动开发正则表达式集合的方法,从而以最少的人机交互驱动高性能的概念识别系统。我们使用我们的方法来开发MutationFinder,这是一种自动从文本中提取点突变提及的系统。 MutationFinder的性能相当于或优于手动开发的突变识别系统,但是其759种模式的生成仅需要5.5专家小时。我们还将讨论我们最近发布的高质量的,带有人工注释的金标准语料库的开发和评估,该标准语料库包含813个摘要中提到的1,515个完整的点突变。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号