首页> 外文会议>International Conference on Information Technology and Intelligent Transportation Systems >Accelerating Reservoir Simulation on Multi-core and Many-Core Architectures with Graph Coloring ILU(k)
【24h】

Accelerating Reservoir Simulation on Multi-core and Many-Core Architectures with Graph Coloring ILU(k)

机译:用图形着色ILU(k)加速了多核和多核架构的储层模拟

获取原文

摘要

Incomplete LU (ILU) methods are widely used in petroleum reservoir simulation and many other applications. However high complexity oftenmakes them the hotspot in the whole simulation due to high complexity when problem size is large. ILU's inherent serial nature also makes them difficult to take full advantage of computing power of multi-core and many-core devices. In this paper, a greedy graph coloringmethod is applied to the ILU(k) factorization and triangular solution phases. This method increases degree of parallelism and improves load balance.Ablock-wise storage format is employed in our ILU implementation in order to take advantage of hierarchical memory structures.Moreover, a dual intensive parallelmodel is proposed to further improve the performance of ILU(k) on GPUs. We test the performance of the proposed parallel ILU(k) with a set of Jacobian systems arising from petroleum reservoir simulation. Numerical results suggest that the proposed parallel ILU(k) method is effective and robust on multi-core and many-core architectures. On an Intel Xeon E5 multi-core CPU, the speedup compared with the serial execution time is 5.6× and 5.4× for factorization and triangular solution, respectively; on an Nvidia K40c GPU card, the speedup can reach 8.6× and 12.7× for factorization and triangular solution, respectively.
机译:LU(ILU)方法广泛用于石油储层模拟和许多其他应用。然而,由于问题尺寸很大,高度复杂性通常会导致整个模拟中的热点。 ilu的固有串行性质也使得它们难以充分利用多核和许多核心设备的计算能力。在本文中,将贪婪的图形着色方法应用于ILU(K)分解和三角溶液相。该方法增加了平行度,提高了负载余额。我们在我们的ILU实现中采用了载荷存储格式,以利用分层存储器结构。提出了一种双密集的并联模型,以进一步提高ILU(k)的性能。在GPU上。我们用石油储层模拟产生的一套雅可比系统测试所提出的平行ILU(k)的性能。数值结果表明,所提出的并行ILU(K)方法对多核和多核架构有效且鲁棒。在英特尔Xeon E5多核CPU上,加速与串行执行时间相比分别为5.6×和5.4×,分别用于分解和三角形解决方案;在NVIDIA K40C GPU卡上,加速度可以分别达到8.6×和12.7×进行分解和三角形解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号