首页> 美国卫生研究院文献>Bioinformatics and Biology Insights >Analysis of Parallel Algorithms on SMP Node and Cluster of Workstations Using Parallel Programming Models with New Tile-based Method for Large Biological Datasets
【2h】

Analysis of Parallel Algorithms on SMP Node and Cluster of Workstations Using Parallel Programming Models with New Tile-based Method for Large Biological Datasets

机译:大型生物数据集基于新图块的并行编程模型对SMP节点和工作站集群的并行算法进行分析

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Sequence alignment is an important tool for describing the relationships between DNA sequences. Many sequence alignment algorithms exist, differing in efficiency, in their models of the sequences, and in the relationship between sequences. The focus of this study is to obtain an optimal alignment between two sequences of biological data, particularly DNA sequences. The algorithm is discussed with particular emphasis on time, speedup, and efficiency optimizations. Parallel programming presents a number of critical challenges to application developers. Today’s supercomputer often consists of clusters of SMP nodes. Programming paradigms such as OpenMP and MPI are used to write parallel codes for such architectures. However, the OpenMP programs cannot be scaled for more than a single SMP node. However, programs written in MPI can have more than single SMP nodes. But such a programming paradigm has an overhead of internode communication. In this work, we explore the tradeoffs between using OpenMP and MPI. We demonstrate that the communication overhead incurs significantly even in OpenMP loop execution and increases with the number of cores participating. We also demonstrate a communication model to approximate the overhead from communication in OpenMP loops. Our results are astonishing and interesting to a large variety of input data files. We have developed our own load balancing and cache optimization technique for message passing model. Our experimental results show that our own developed techniques give optimum performance of our parallel algorithm for various sizes of input parameter, such as sequence size and tile size, on a wide variety of multicore architectures.
机译:序列比对是描述DNA序列之间关系的重要工具。存在许多序列比对算法,其效率,序列模型和序列之间的关系不同。这项研究的重点是在生物学数据的两个序列(尤其是DNA序列)之间获得最佳比对。讨论该算法时特别强调时间,加速和效率优化。并行编程给应用程序开发人员带来了许多关键挑战。当今的超级计算机通常由SMP节点群集组成。诸如OpenMP和MPI之类的编程范例用于编写此类体系结构的并行代码。但是,OpenMP程序不能扩展到一个以上的SMP节点。但是,用MPI编写的程序可以具有多个SMP节点。但是这种编程范例具有节点间通信的开销。在这项工作中,我们探索了使用OpenMP和MPI之间的权衡。我们证明,即使在OpenMP循环执行中,通信开销也会显着增加,并且随着参与的内核数量的增加而增加。我们还演示了一种通信模型,可以估算OpenMP循环中的通信开销。对于各种各样的输入数据文件,我们的结果令人惊讶且有趣。我们已经为消息传递模型开发了自己的负载平衡和缓存优化技术。我们的实验结果表明,我们自己开发的技术可在多种多核体系结构上为各种大小的输入参数(例如序列大小和图块大小)提供并行算法的最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号