Not all links are equal: Exploiting Dependency Types for the Extraction of Protein-Protein Interactions from Text

机译：并非所有链接都是等：利用依赖类型，以提取来自文本的蛋白质 - 蛋白质相互作用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The extraction of protein-protein interactions (PPIs) reported in scientific publications is one of the most studied topics in Text Mining in the Life Sciences, as such algorithms can substantially decrease the effort for databases curators. The currently best methods for this task are based on analyzing the dependency tree (DT) representation of sentences. Many approaches exploit only topological features and thus do not yet fully exploit the information contained in DTs. We show that incorporating the grammatical information encoded in the types of the dependencies in DTs noticeably improves extraction performance by using a pattern matching approach. We automatically infer a large set of linguistic patterns using only information about interacting proteins. Patterns are then refined based on shallow linguistic features and the semantics of dependency types. Together, these lead to a total improvement of 17.2 percent points in F_1, as evaluated on five publicly available PPI corpora. More than half of that improvement is gained by properly handling dependency types. Our method provides a general framework for building task-specific relationship extraction methods that do not require annotated training data. Furthermore, our observations offer methods to improve upon relation extraction approaches.

机译：在科学出版物报道的蛋白质 - 蛋白质相互作用（生产者价格指数）的提取是在文本挖掘生命科学研究最多的话题之一，因为这种算法可以大大降低对数据库策展人的努力。此任务的最佳当前方法是基于分析句子的依赖关系树（DT）表示。许多方法只利用拓扑特征，因此并没有充分利用载有酒瘾的信息。我们发现，在纳入该类型DTS中依赖的编码语法信息明显使用模式匹配方法提高萃取性能。我们使用约相互作用的蛋白质只有信息自动推断出一大套的语言模式。那么模式是基于浅层语言特征和依赖类型的语义细化。总之，这些导致的17.2％点F_1共改善，在五个公开可用的PPI语料进行评估。超过半数的改善是通过正确处理依赖型上涨。我们的方法为构建不需要注释的训练数据的特定任务的关系提取方法的总体框架。此外，我们的观察提供了方法，提高在关系抽取方法。

著录项

来源
《Workshop on biomedical natural language processing》|2011年||共9页
会议地点
作者
Philippe Thomas; Stefan Pietschmann; Illes Solt; Domonkos Tikk; Ulf Leser;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. 蛋白质相互作用的分析:利用酵母两性杂交系统探索蛋白质功能 [J] . 中国林学（英文版） . 2002,第001期
2. Efficient Extraction of Protein-Protein Interactions from Full-Text Articles [J] . Hakenberg Jörg, Leaman Robert, Ha Vo Nguyen, Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2010,第3期

机译：从全文文章中高效提取蛋白质与蛋白质的相互作用
3. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text [J] . Martin Krallinger, Miguel Vazquez, Florian Leitner, BMC Bioinformatics . 2011,第SUPPLEMENTa8期

机译：BioCreative III的蛋白质-蛋白质相互作用任务：文章的分类/排名以及将生物本体学概念链接到全文
4. Development and Exploitation of Photo-Crosslinking Methodology to Study Protein-Protein Interactions [J] . Wilson Andrew Journal of peptide science: An official publication of the European Peptide Society . 2018,第S2期

机译：光交联方法研究蛋白质 - 蛋白质相互作用的开发与利用
5. Not all links are equal: Exploiting Dependency Types for the Extraction of Protein-Protein Interactions from Text [C] . Philippe Thomas, Stefan Pietschmann, Illes Solt, Workshop on biomedical natural language processing 2011. . 2011

机译：并非所有链接都是相等的：利用依赖类型从文本中提取蛋白质-蛋白质相互作用
6. The Making and Breaking of Lipids: The Characterization of the Protein-Protein and Protein-Substrate Interactions and Bioengineering of Type II Fatty Acid Synthases FabA and AcpP and the Structural and Functional Characterization of Iterative Type I Trans-Acting Enoyl-Reductase, LovC Polyketide Synthase. [D] . Nguyen, Chi Hanh Thi. 2014

机译：脂质的产生和断裂：II型脂肪酸合酶FabA和AcpP的蛋白质-蛋白质和蛋白质-底物相互作用的表征和生物工程以及I型迭代反式烯丙基还原酶，LovC聚酮合酶的结构和功能表征。
7. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text [O] . Martin Krallinger, Miguel Vazquez, Florian Leitner, 2011

机译：BioCreative III的蛋白质-蛋白质相互作用任务：文章的分类/排名以及将生物本体学概念链接到全文
8. Analysis of link grammar on biomedical dependency corpus targeted at protein-protein interactions [O] . Sampo Pyysalo, Filip Ginter, Tapio Pahikkala, 2004

机译：针对蛋白质-蛋白质相互作用的生物医学依赖语料库的链接语法分析
9. Novel Protein-Protein Interactions of the Yersinia pestis Type III secretion System Elucidated With a Matrix Analysis by Surface Plasmon Resonance and Mass Spectrometry [R] . Swietnicki, W. , O'Brien, S. , Holman, K. , 2004

机译：通过表面等离子体共振和质谱法进行基质分析阐明的鼠疫耶尔森氏菌III型分泌系统的新蛋白质 - 蛋白质相互作用

Not all links are equal: Exploiting Dependency Types for the Extraction of Protein-Protein Interactions from Text

摘要

著录项

相似文献

相关主题

期刊订阅