首页> 外文会议>International conference on neural information processing >Metabolite Named Entity Recognition: A Hybrid Approach
【24h】

Metabolite Named Entity Recognition: A Hybrid Approach

机译:代谢物命名实体识别:一种混合方法

获取原文

摘要

Since labor intensive and time consuming issue, manual cura-tion in metabolic information extraction currently was replaced by text mining (TM). While TM in metabolic domain has been attempted previously, it is still challenging due to variety of specific terms and their meanings in different contexts. Named Entity Recognition (NER) generally used to identify interested keyword (protein and metabolite terms) in sentence, this preliminary task therefore highly influences the performance of metabolic TM framework. Conditional Random Fields (CRFs) NER has been actively used during a last decade, because it explicitly outperforms other approaches. However, an efficient CRFs-based NER depends purely on a quality of corpus which is a nontrivial task to produce. This paper introduced a hybrid solution which combines CRFs-based NER, dictionary usage, and complementary modules (constructed from existing corpus) in order to improve the performance of metabolic NER and another similar domain.
机译:由于劳动密集和费时的问题,目前在代谢信息提取中的手动管理已被文本挖掘(TM)取代。尽管先前已经尝试过在代谢域中使用TM,但是由于各种特定术语及其在不同情况下的含义,它仍然具有挑战性。命名实体识别(NER)通常用于识别句子中感兴趣的关键字(蛋白质和代谢物术语),因此,这项初步任务极大地影响了代谢TM框架的性能。在过去的十年中,条件随机场(CRF)NER已被积极使用,因为它明显胜过其他方法。但是,基于CRF的高效NER完全取决于语料库的质量,这是一项重要任务。本文介绍了一种混合解决方案,该解决方案结合了基于CRF的NER,字典用法和互补模块(由现有语料库构成),以提高代谢NER和另一个类似域的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号