首页> 外文期刊>International Journal of High Performance Computing Applications >Efficient graphic processing unit implementation of the chemical-potential multiphase lattice Boltzmann method
【24h】

Efficient graphic processing unit implementation of the chemical-potential multiphase lattice Boltzmann method

机译:化学电位多相晶格Boltzmann方法的高效图形处理单元实现

获取原文
获取原文并翻译 | 示例
           

摘要

The chemical-potential multiphase lattice Boltzmann method (CP-LBM) has the advantages of satisfying the thermodynamic consistency and Galilean invariance, and it realizes a very large density ratio and easily expresses the surface wettability. Compared with the traditional central difference scheme, the CP-LBM uses the Thomas algorithm to calculate the differences in the multiphase simulations, which significantly improves the calculation accuracy but increases the calculation complexity. In this study, we designed and implemented a parallel algorithm for the chemical-potential model on a graphic processing unit (GPU). Several strategies were used to optimize the GPU algorithm, such as coalesced access, instruction throughput, thread organization, memory access, and loop unrolling. Compared with dual-Xeon 5117 CPU server, our methods achieved 95 times speedup on an NVIDIA RTX 2080Ti GPU and 106 times speedup on an NVIDIA Tesla P100 GPU. When the algorithm was extended to the environment with dual NVIDIA Tesla P100 GPUs, 189 times speedup was achieved and the workload of each GPU reached 96%.
机译:化学电位多相晶格Boltzmann方法(CP-LBM)具有满足热力学稠度和加里利利尼依赖性的优点,并且它实现了非常大的密度比,并且容易表达表面润湿性。与传统的中央差分方案相比,CP-LBM使用托马斯算法来计算多相模拟的差异,这显着提高了计算精度,但增加了计算复杂性。在本研究中,我们在图形处理单元(GPU)上设计并实施了化学电位模型的并行算法。使用几种策略来优化GPU算法,例如聚合的访问,指令吞吐量,线程组织,内存访问和循环展开。与Dual-Xeon 5117 CPU服务器相比,我们的方法在NVIDIA RTX 2080TI GPU上实现了95倍的加速,并在NVIDIA Tesla P100 GPU上加速了106倍。当算法扩展到具有双NVIDIA TESLA P100 GPU的环境时,实现了189倍的加速,每个GPU的工作量达到96%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号