Analysis of Protein/Protein Interactions Through Biomedical Literature: Text Mining of Abstracts vs. Text Mining of Full Text Articles

机译：通过生物医学文献分析蛋白质/蛋白质相互作用：摘要的文本挖掘与全文文章的文本挖掘

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The challenge of knowledge management in the pharmaceutical industry is twofold. First it has to address the integration of sequence data with the vast and growing body of data from functional analysis of genes with the information in huge historical archival databases. Second, as the number of bio-medical publications exponentially increases (Medline now contains more than 13 million records), researchers require assistance in order to broaden their vision and comprehension of scientific domains. Analogous to data mining in the sense that it uncovers relationships in information, text mining uncovers relationships in a text collection and leverages the creativity of the knowledge worker in the exploration of these relationships and in the discovery of new knowledge. We describe herein a text mining method to automatically detect protein interactions which are described across a large amount of scientific publications. This method relies on natural language processing to identify protein names, their synonyms and the various interactions they can bear with other proteins. We have then compared text mining analysis on abstracts to the same kind of analysis on full text articles to assess how much information is lost when only abstracts are processed. Our results show that: 1)LexiQuest Mine is a very versatile and accurate tool when mining biomedical literature to analyze interactions between proteins. 2)Mining only abstracts can be sufficient and time saving for applications that do not require a high level of detail on a large scale whereas mining full text articles is to be chosen for more exhaustive applications designed to address a specific issue. Availability: LexiQuest Mine is available for commercial licensing from SPSS, Inc.

机译：制药行业知识管理的挑战是双重的。首先，它必须解决序列数据与来自基因功能分析的庞大且不断增长的数据集成问题，以及庞大的历史档案数据库中的信息。其次，随着生物医学出版物数量成倍增加（Medline现在包含超过1300万条记录），研究人员需要帮助以拓宽视野和理解科学领域。从数据挖掘揭示信息中的关系的意义上讲，类似于数据挖掘，文本挖掘在文本集合中揭示关系，并在探索这些关系和发现新知识时利用知识工作者的创造力。我们在本文中描述了一种文本挖掘方法，该方法可自动检测蛋白质相互作用，这在大量科学出版物中都有描述。这种方法依靠自然语言处理来识别蛋白质名称，它们的同义词以及它们与其他蛋白质的各种相互作用。然后，我们将对摘要的文本挖掘分析与对全文文章的同类分析进行了比较，以评估仅处理摘要时丢失了多少信息。我们的结果表明：1）LexiQuest Mine在挖掘生物医学文献以分析蛋白质之间的相互作用时是一种非常通用且准确的工具。 2）对于不需要大量详细信息的应用程序，仅挖掘摘要就足够了，并且可以节省时间，而对于那些旨在解决特定问题的更详尽的应用程序，则应选择挖掘全文文章。可用性：LexiQuest矿可从SPSS，Inc.获得商业许可。

著录项

来源
《International Symposium on Knowledge Exploration in Life Science Informatics(KELSI 2004); 20041125-26; Milan(IT)》|2004年|P.96-108|共13页
会议地点 Milan(IT)
作者
Eric P.G. Martin; Eric G. Bremer; Marie-Claude Guerin; Catherine DeSesa; Olivier Jouve;
展开▼
作者单位

SPSS, Tour Europlazza, La Defense 4, F-92925 Paris-la-Defense Cedex, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. How to link ontologies and protein–protein interactions to literature: text-mining approaches and the BioCreative experience [J] . Alfonso Valencia, Andrew Chatr-aryamontri, Christophe Marcelle, Database . 2012,第40期

机译：如何将本体论和蛋白质之间的相互作用与文献联系：文本挖掘方法和BioCreative经验
2. A comprehensive and quantitative comparison of text-mining in 15 million full-text articles versus their corresponding abstracts [J] . David Westergaard, Hans-Henrik St?rfeldt, Christian T?nsberg, PLoS Computational Biology . 2018,第2期

机译：对1500万篇全文文章中的文本挖掘与相应摘要进行全面，定量的比较
3. A Framework of Protein-Drug Association for Malaria by Text Data Mining of Biomedical Literature. [J] . E. KADIVAR, Kh. RAHIMI, M. A. SHAHZAMANIAN Research Journal of Pharmaceutical, Biological and Chemical Sciences . 2016,第4期

机译：生物医学文献文本数据挖掘的疟疾蛋白质药物关联框架。
4. Analysis of Protein/Protein Interactions Through Biomedical Literature: Text Mining of Abstracts vs. Text Mining of Full Text Articles [C] . Eric P.G. Martin, Eric G. Bremer, Marie-Claude Guerin, International Symposium on Knowledge Exploration in Life Science Informatics(KELSI 2004) . 2004

机译：通过生物医学文献分析蛋白质/蛋白质相互作用：摘要文本挖掘与文本挖掘全文文章
5. Using text mining to extract gene and protein synonyms from biomedical texts [D] . Duong, Duc C. 2007

机译：使用文本挖掘从生物医学文本中提取基因和蛋白质同义词
6. Analysis of Protein Phosphorylation and Its Functional Impact on Protein-Protein Interactions via Text Mining of the Scientific Literature [O] . Qinghua Wang, Karen E Ross, Hongzhan Huang, -1

机译：通过科学文献的文本挖掘分析蛋白质的磷酸化及其对蛋白质与蛋白质相互作用的功能影响
7. Analysis of Text Mining from Full-text Articles and Abstracts by Postgraduates Students in Selected Nigeria Universities [O] . Mariam Taiwo Ibrahim, Adeyinka Tella 2020

机译：选定尼日利亚大学学生的全文文章与摘要文本挖掘分析
8. Text Mining the Biomedical Literature. [R] . Kostoff, R. N. 2007

机译：文本挖掘生物医学文献。

Analysis of Protein/Protein Interactions Through Biomedical Literature: Text Mining of Abstracts vs. Text Mining of Full Text Articles

摘要

著录项

相似文献

相关主题

期刊订阅