首页> 外文学位 >Studying the effect of parallelization on the performance of Andromeda Search Engine: A search engine for peptides.
【24h】

Studying the effect of parallelization on the performance of Andromeda Search Engine: A search engine for peptides.

机译:研究并行化对仙女座搜索引擎性能的影响:肽搜索引擎。

获取原文
获取原文并翻译 | 示例

摘要

Human body is made of proteins. The analysis of structure and functions of these proteins reveal important information about human body. An important technique used for protein evaluation is Mass Spectrometry. The protein data generated using mass spectrometer is analyzed for the detection of patterns in proteins. A wide variety of operations are performed on the data obtained from a mass spectrometer namely visualization, spectral deconvolution, peak alignment, normalization, pattern recognition and significance testing. There are a number of software that analyze the huge volume of data generated from a mass spectrometer. An example of such a software is MaxQuant that analyzes high resolution mass spectrometric data. A search engine called Andromeda is integrated into MaxQuant that is used for peptide identification.;One major drawback of the Andromeda Search Engine is its execution time. Identification of peptides involves a number of complex operations and intensive data processing. Therefore this research work focuses on implementing parallelization as a way to improve the performance of the Andromeda Search Engine. This is done by partitioning the data and distributing it across various cores and nodes. Also multiple tasks are executed concurrently on multiple nodes and cores.;A number of bioinformatics applications have been parallelized with significant improvement in execution time over the serial version. For this research work Task Parallel Library (TPL) and Common Library Runtime (CLR) constructs are used for parallelizing the application. The aim of this research work is to implement these techniques to parallelize the Andromeda Search Engine and gain improvement in the execution time by leveraging multi core architecture.
机译:人体是由蛋白质制成的。这些蛋白质的结构和功能分析揭示了有关人体的重要信息。用于蛋白质评估的一项重要技术是质谱法。分析使用质谱仪生成的蛋白质数据,以检测蛋白质中的模式。对从质谱仪获得的数据执行各种各样的操作,即可视化,光谱去卷积,峰对齐,归一化,模式识别和重要性测试。有许多软件可以分析质谱仪生成的大量数据。这种软件的一个示例是MaxQuant,它可以分析高分辨率质谱数据。 MaxQuant集成了一个名为Andromeda的搜索引擎,用于肽段识别。Andromeda搜索引擎的一个主要缺点是执行时间长。肽的鉴定涉及许多复杂的操作和密集的数据处理。因此,本研究工作集中于实现并行化,以提高Andromeda搜索引擎的性能。这是通过对数据进行分区并将其分布在各个核心和节点上来完成的。同时,多个任务在多个节点和内核上同时执行。;与串行版本相比,许多生物信息学应用程序已经并行化,执行时间大大缩短。对于本研究工作,使用任务并行库(TPL)和公共库运行时(CLR)构造对应用程序进行并行化。这项研究工作的目的是实施这些技术以使Andromeda搜索引擎并行化,并通过利用多核体系结构来提高执行时间。

著录项

  • 作者

    Shah, Jigna.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Computer science.
  • 学位 M.S.
  • 年度 2015
  • 页码 64 p.
  • 总页数 64
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号