...
首页> 外文期刊>Parallel Computing >Parallelizing and optimizing a bioinformatics pairwise sequence alignment algorithm for many-core architecture
【24h】

Parallelizing and optimizing a bioinformatics pairwise sequence alignment algorithm for many-core architecture

机译:用于多核架构的生物信息学成对序列比对算法的并行化和优化

获取原文
获取原文并翻译 | 示例
           

摘要

Current computer engineering evolves at an accelerated pace, with hardware advancing towards new chip multiprocessors (CMP) architectures and with supporting software gearing towards new programming and abstraction paradigms, to obtain the maximum efficiency of the hardware at a low cost. In this context, Tilera Corporation has developed a brand new CMP architecture with 64 cores (tiles) called Tile64, and has launched several Peripheral Component Interconnect Express (PCIe) cards to be used and monitored from a host Personal Computer (PC). These cards may execute parallel applications built in C/ C++ and compiled with the Tile-GCC compiler. We have previously demonstrated the usefulness of the Tile64 architecture for bioinformatics [S. Galvez, D. Diaz, P. Hernandez, F.J. Esteban, J.A. Caballero, G. Dorado, Next-generation bioinformatics: using many-core processor architecture to develop a web service for sequence alignment, Bioinformatics, 26 (2010) 683-686]. We have chosen a bioinformatics algorithm to test this many-core Tile64 architecture because of actual bioinformatics challenging needs: data-intensive workloads, space and time-consuming requirements and massive calculation. This algorithm, known as Needleman-Wunsch/Smith-Waterman (NW/SW), obtains an optimal sequence alignment in quadratic time and space cost, yet requires to be optimized to take full advantage of computing parallelization. In this paper we redesign, implement and fine-tune this algorithm, introducing key optimizations and changes that take advantage of specific Tile64 characteristics: RISC architecture, local tile's cache, length of memory word, shared memory usage, RAM file system, tile's intercommunication and job selection from a pool. The resulting algorithm - named MC64-NW/SW for Multicore64 Needleman-Wunsch/Smith-Waterman - achieves a gain of ~1000% when compared with the same algorithm on a x86 multi-core architecture. As far as we know, our NW/SW implementation is the fastest ever published for a standalone PC when aligning a pair of sequences larger than 20 kb.
机译:当前的计算机工程技术发展日新月异,硬件向着新的芯片多处理器(CMP)架构发展,支持软件向着新的编程和抽象范例发展,从而以低成本获得了最大的硬件效率。在这种情况下,Tilera公司开发了一种全新的CMP体系结构,该体系结构具有称为Tile64的64个内核(平铺),并推出了数个外围组件互连Express(PCIe)卡,可从主机个人计算机(PC)使用和监视这些卡。这些卡可以执行C / C ++内置并由Tile-GCC编译器编译的并行应用程序。我们之前已经证明了Tile64体系结构对生物信息学的有用性。加尔维斯·D·迪亚兹·P·埃尔南德斯·F·J·埃斯特万·J.A。 Caballero,G。Dorado,“下一代生物信息学:使用多核处理器体系结构开发用于序列比对的Web服务,Bioinformatics,26(2010)683-686]。由于实际的生物信息学挑战性需求:数据密集型工作负载,空间和耗时的需求以及大量计算,我们选择了一种生物信息学算法来测试这种多核Tile64架构。这种称为Needleman-Wunsch / Smith-Waterman(NW / SW)的算法以二次时间和空间成本获得了最佳的序列比对,但仍需要进行优化以充分利用计算并行化的优势。在本文中,我们将重新设计,实现和微调该算法,并介绍利用特定Tile64特性的关键优化和更改:RISC体系结构,本地tile的高速缓存,内存字的长度,共享内存的使用,RAM文件系统,tile的互通和从池中选择工作。与x86多核体系结构上的相同算法相比,所产生的算法-用于Multicore64 Needleman-Wunsch / Smith-Waterman的名为MC64-NW / SW的算法可实现〜1000%的增益。据我们所知,当对齐一对大于20 kb的序列时,我们的NW / SW实现是有史以来最快的独立PC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号