Efficiently Querying Protein Sequences with the Proteinus Index

机译：使用蛋白质索引有效查询蛋白质序列

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Finding similarities in protein sequences is a core problem in bioinformatics. It represents the first step in the functional characterization of novel protein sequences, and is also employed in protein evolution studies and for predicting biological structure. In this paper, we propose Proteinus, a new index aimed at similarity search of protein sequences. Proteinus is characterized by using a reduced amino acid alphabet to represent protein sequences and also by providing a persistent storage of the index on disk, as well as by allowing the execution of range queries. Performance tests with real-world protein sequences showed that the Proteinus index was very efficient. Compared with the BLASTP tool, Proteinus provided an impressive performance gain from 45% up to 93% for range query processing.

机译：在蛋白质序列中寻找相似性是生物信息学的核心问题。它代表了新蛋白质序列功能表征的第一步，也用于蛋白质进化研究和预测生物学结构。在本文中，我们提出了Proteinus，一种旨在对蛋白质序列进行相似性搜索的新索引。 Proteinus的特征是使用减少的氨基酸字母代表蛋白质序列，还通过在磁盘上提供索引的持久存储，以及允许执行范围查询。对真实蛋白质序列的性能测试表明，Proteinus指数非常有效。与BLASTP工具相比，Proteinus为范围查询处理提供了令人印象深刻的性能提升，从45％到93％。

著录项

来源
《Advances in bioinformatics and computational biology.》|2011年|p.58-65|共8页
会议地点 Brasilia(BR);Brasilia(BR)
作者
Felipe Alves da Louza; Ricardo Rodrigues Ciferri; Cristina Dutra de Aguiar Ciferri;
展开▼
作者单位

Department of Computer Science, University of Sao Paulo, 13.560-970, Sao Carlos, SP, Brasil;

Department of Computer Science, Federal University of Sao Carlos, 13.565-905, Sao Carlos, SP, Brasil;

Department of Computer Science, University of Sao Paulo, 13.560-970, Sao Carlos, SP, Brasil;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类生物工程学（生物技术）;生物工程学（生物技术）;
关键词
protein sequences; similarity search; index structure;

机译：蛋白质序列；相似度搜索索引结构;

相似文献

外文文献
中文文献
专利

1. Proteinus crenulatus Pandellé (Staphylinidae) new to Ireland with a comment on separation from other Proteinus [J] . Roy Anderson The coleopterist . 2014,第Pta3期

机译：爱尔兰新产的Proteus crenulatusPandellé（Staphylinidae），与其他Proteusus的分离有关
2. PSSARD (2.0): A database server for making flexible queries relating amino acid sequences to main-chain secondary structure conformations for proteins of known three-dimensional structure and certain useful applications [J] . Sridhar S, Babu AVN, Guruprasad K International Journal of Biological Macromolecules: Structure, Function and Interactions . 2007,第1期

机译：PSSARD（2.0）：一种数据库服务器，用于灵活地查询氨基酸序列与已知三维结构的蛋白质的主链二级结构构象以及某些有用的应用程序
3. Two New Species of the Genus Proteinus from Japan (Coleoptera: Staphylinidae: Proteininae) [J] . Yasuhiko HAYASHI Japanese Journal of Systematic Entomology . 2017,第2期

机译：来自日本的两种新的属丙蛋白（鞘翅目：葡萄球菌：蛋白质）
4. Efficiently Querying Protein Sequences with the Proteinus Index [C] . Felipe Alves da Louza, Ricardo Rodrigues Ciferri, and Cristina Dutra de Aguiar Ciferri Brazilian Symposium on Bioinformatics . 2011

机译：用蛋白质指数有效地询问蛋白质序列
5. Efficient implementation of update and retrieval query sequences over large data sets in a native XML database [D] . Mikhaylov, Alexander 2006

机译：在本机XML数据库中对大型数据集的更新和检索查询序列的有效实现
6. Raptor: A fast and space-efficient pre-filter for querying very large collections of nucleotide sequences [O] . Enrico Seiler, Svenja Mehringer, Mitra Darvish, 2021

机译：猛禽：用于查询非常大的核苷酸序列集合的快速和空间高效的预过滤器
7. Raptor: A fast and space-efficient pre-filter for querying very large collections of nucleotide sequences [O] . Enrico Seiler, Svenja Mehringer, Mitra Darvish, 2020

机译：猛禽：用于查询非常大的核苷酸序列集合的快速和空间高效的预过滤器

Efficiently Querying Protein Sequences with the Proteinus Index

摘要

著录项

相似文献

相关主题

期刊订阅