...
首页> 外文期刊>Pattern recognition letters >CUDA-MEME: Accelerating motif discovery in biological sequences using CUDA-enabled graphics processing units
【24h】

CUDA-MEME: Accelerating motif discovery in biological sequences using CUDA-enabled graphics processing units

机译:CUDA-MEME:使用支持CUDA的图形处理单元加速生物序列中的基序发现

获取原文
获取原文并翻译 | 示例
           

摘要

Motif discovery in biological sequences is of prime importance and a major challenge in computational biology. Consequently, numerous motif discovery tools have been developed to date. However, the rapid growth of both genomic sequence and gene transcription data, establishes the need for the development of scalable motif discovery tools. An approach to improve the runtime of motif discovery by an order-of-magnitude without losing sensitivity is to employ emerging many-core architectures such as CUDA-enabled CPUs. In this paper, we present a highly parallel formulation and implementation of the MEME motif discovery algorithm using the CUDA programming model. To achieve high efficiency, we introduce two parallelization approaches: sequence-level and substring-level parallelization. Furthermore, a hybrid computing framework is described to take advantage of both CPU and GPU compute resources. Our performance evaluation on a GeForce GTX 280 GPU, results in average runtime speedups of 21.4 (19.3) for the starting point search and 20.5 (16.4) for the overall runtime using the OOPS (ZOOPS) motif search model. The runtime speedups of CUDA-MEME on a single GPU are also comparable to those of ParaMEME running on 16 CPU cores of a high-performance workstation cluster. In addition to the fast speed, CUDA-MEME has the capability of finding motif instances consistent with the sequential MEME.
机译:在生物序列中发现基序是最重要的,也是计算生物学中的主要挑战。因此,迄今为止已经开发了许多主题发现工具。但是,基因组序列和基因转录数据的快速增长,建立了可扩展的基序发现工具的开发需求。在不损失敏感度的情况下,以一个数量级的方式改善主题发现的运行时间的方法是采用新兴的多核体系结构,例如支持CUDA的CPU。在本文中,我们介绍了使用CUDA编程模型的MEME主题发现算法的高度并行表示和实现。为了实现高效率,我们引入了两种并行化方法:序列级并行化和子字符串级并行化。此外,描述了一种混合计算框架,以利用CPU和GPU计算资源。我们对GeForce GTX 280 GPU的性能评估得出,使用OOPS(ZOOPS)主题搜索模型,起点搜索的平均运行时加速比为21.4(19.3),整体运行时的平均运行时加速比为20.5(16.4)。在单个GPU上CUDA-MEME的运行时加速也可与在高性能工作站集群的16个CPU内核上运行的ParaMEME相比。除了快速以外,CUDA-MEME还具有查找与顺序MEME一致的基序实例的功能。

著录项

  • 来源
    《Pattern recognition letters》 |2010年第14期|P.2170-2177|共8页
  • 作者单位

    School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore;

    rnSchool of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore;

    rnSchool of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore;

    rnSchool of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    CUDA; MEME; GPU; motif discovery;

    机译:CUDA;MEME;GPU;发现动机;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号