首页> 外文会议>IEEE International Conference on Bioinformatics and Bioengineering >Towards Centralized MS/MS Spectra Preprocessing: An Empirical Evaluation of Peptides Search Engines using Ground Truth Datasets
【24h】

Towards Centralized MS/MS Spectra Preprocessing: An Empirical Evaluation of Peptides Search Engines using Ground Truth Datasets

机译:用于集中的MS / MS Spectra预处理:使用地面真理数据集的肽搜索引擎的实证评估

获取原文

摘要

several peptides search engines have been developed in the recent decades. Most of the time and for the same inputs, different search enginesa€? result in different peptides were identified, which can confuse the stakeholders in the field of proteomics. The massive amount of generated spectra by high throughput spectrometers adds another challenge which handicaps the current search engines. This motivates the researchers to evaluate the combination of several search engines. Several studies provided ensemble solutions over shared and distributed computing environments for reliable results. However, the massive amount of MS/MS spectra is a cumbersome traffic over the systemsa€? networks. This issue directly impacts the searching performance and also adds unnecessary extra costs (computing, storage, network traffic) if cloud cluster is being used. The main question of this paper is: Can we build a central MS/MS spectra preprocessing for semantically different protein search engines? We evaluate different statistical reduction techniques using four popular protein search engines. In order to fairly evaluate the results, we build ground truth unanimous-based datasets for two different species; yeast and human. Our techniques result in significant peak reduction, where only around 30% of the spectra peaks are enough to report reliable identifications from the used search engines in this study.
机译:几十年来,几种肽搜索引擎已经发展起来。大多数时间和同一投入,不同的搜索招€?鉴定出不同肽的结果,这可以混淆蛋白质组学领域的利益攸关方。高吞吐量光谱仪的大量产生的光谱增加了涉及当前搜索引擎的另一个挑战。这使研究人员能够评估几个搜索引擎的组合。几项研究提供了共享和分布式计算环境的集合解决方案,以获得可靠的结果。但是,大量的MS / MS光谱是系统的繁琐交通兑换?网络。如果正在使用云群集,此问题直接影响搜索性能,并增加了不必要的额外成本(计算,存储,网络流量)。本文的主要问题是:我们可以为语义不同的蛋白质搜索引擎建立一个中央MS / MS Spectra预处理吗?我们使用四个流行的蛋白质搜索引擎评估不同的统计减少技术。为了公平评估结果,我们为两种不同物种构建了基于实际的基于数据集;酵母和人类。我们的技术导致显着的峰值降低,其中距离频谱峰的大约30°峰值足以从本研究中的使用中的搜索引擎报告可靠的标识。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号