【24h】

Extraction of Gene/Protein Interaction from Text Documents with Relation Kernel

机译:从具有关联内核的文本文档中提取基因/蛋白质相互作用

获取原文
获取原文并翻译 | 示例

摘要

Even though there are many databases for gene/protein interactions, most such data still exist only in the biomedical literature. They are spread in biomedical literature written in natural languages and they require much effort such as data mining for constructing well-structured data forms. As genomic research advances, knowledge discovery from a large collection of scientific papers is becoming more important for efficient biological and biomedical researches. In this paper, we present a relation kernel based interaction extraction method to resolve this problem. We extract gene/protein interactions of Yeast (S.cerevisiae) from text documents with relation kernel. Kernel for relation extraction is constructed with predefined interaction corpus and set of interaction patterns. Proposed relation kernel for interaction extraction only exploits shallow parsed documents. Experimental results show that the proposed kernel method achieves a recall rate of 78.3% and precision rate of 79.9% for gene/protein interaction extraction without full parsing efforts.
机译:即使有许多基因/蛋白质相互作用的数据库,大多数这样的数据仍然仅存在于生物医学文献中。它们散布在以自然语言编写的生物医学文献中,并且需要大量工作,例如用于构建结构良好的数据形式的数据挖掘。随着基因组研究的发展,从大量科学论文中发现知识对于有效的生物学和生物医学研究变得越来越重要。在本文中,我们提出了一种基于关系内核的交互提取方法来解决该问题。我们从具有相关内核的文本文档中提取酵母(S.cerevisiae)的基因/蛋白质相互作用。使用预定义的交互语料库和一组交互模式构造用于关系提取的内核。提议的用于交互提取的关系内核仅利用浅层分析的文档。实验结果表明,所提出的核方法在不进行充分解析的情况下,实现了基因/蛋白质相互作用提取的查全率达78.3%,准确率达79.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号