...
首页> 外文期刊>BMC Bioinformatics >DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI
【24h】

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI

机译:DecGPU:使用CUDA和MPI在大规模并行图形处理单元上进行分布式错误纠正

获取原文
           

摘要

Background Next-generation sequencing technologies have led to the high-throughput production of sequence data (reads) at low cost. However, these reads are significantly shorter and more error-prone than conventional Sanger shotgun reads. This poses a challenge for the de novo assembly in terms of assembly quality and scalability for large-scale short read datasets. Results We present DecGPU, the first parallel and distributed error correction algorithm for high-throughput short reads (HTSRs) using a hybrid combination of CUDA and MPI parallel programming models. DecGPU provides CPU-based and GPU-based versions, where the CPU-based version employs coarse-grained and fine-grained parallelism using the MPI and OpenMP parallel programming models, and the GPU-based version takes advantage of the CUDA and MPI parallel programming models and employs a hybrid CPU+GPU computing model to maximize the performance by overlapping the CPU and GPU computation. The distributed feature of our algorithm makes it feasible and flexible for the error correction of large-scale HTSR datasets. Using simulated and real datasets, our algorithm demonstrates superior performance, in terms of error correction quality and execution speed, to the existing error correction algorithms. Furthermore, when combined with Velvet and ABySS, the resulting DecGPU-Velvet and DecGPU-ABySS assemblers demonstrate the potential of our algorithm to improve de novo assembly quality for de-Bruijn-graph-based assemblers. Conclusions DecGPU is publicly available open-source software, written in CUDA C++ and MPI. The experimental results suggest that DecGPU is an effective and feasible error correction algorithm to tackle the flood of short reads produced by next-generation sequencing technologies.
机译:背景技术下一代测序技术已导致以低成本高产量地生产序列数据(读)。但是,这些读取比传统的Sanger read弹枪读取要短得多,并且更容易出错。对于从头组装,这在大规模短读数据集的组装质量和可伸缩性方面构成了挑战。结果我们展示了DecGPU,这是使用CUDA和MPI并行编程模型的混合组合的,用于高通量短读(HTSR)的第一个并行和分布式纠错算法。 DecGPU提供基于CPU和基于GPU的版本,其中基于CPU的版本使用MPI和OpenMP并行编程模型使用粗粒度和细粒度并行机制,而基于GPU的版本则利用CUDA和MPI并行编程建模并采用CPU + GPU混合计算模型,通过重叠CPU和GPU计算来最大化性能。我们算法的分布式特征使其对于大规模HTSR数据集的纠错具有可行性和灵活性。使用模拟和真实数据集,我们的算法在纠错质量和执行速度方面展示了优于现有纠错算法的性能。此外,当与Velvet和ABySS结合使用时,所得的DecGPU-Velvet和DecGPU-ABySS汇编程序证明了我们算法的潜力,可以提高基于de-Bruijn-graph的汇编程序的从头汇编质量。结论DecGPU是公开可用的开源软件,用CUDA C ++和MPI编写。实验结果表明,DecGPU是一种有效且可行的纠错算法,可解决下一代测序技术产生的大量短读问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号