首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Refactoring and Optimizing the Community Atmosphere Model (CAM) on the Sunway TaihuLight Supercomputer
【24h】

Refactoring and Optimizing the Community Atmosphere Model (CAM) on the Sunway TaihuLight Supercomputer

机译:在Sunway TaihuLight超级计算机上重构和优化社区气氛模型(CAM)

获取原文

摘要

This paper reports our efforts on refactoring and optimizing the Community Atmosphere Model (CAM) on the Sunway TaihuLight supercomputer, which uses a many-core processor that consists of management processing elements (MPEs) and clusters of computing processing elements (CPEs). To map the large code base of CAM to the millions of cores on the Sunway system, we take OpenACC-based refactoring as the major approach, and apply source-to-source translator tools to exploit the most suitable parallelism for the CPE cluster, and to fit the intermediate variable into the limited on-chip fast buffer. For individual kernels, when comparing the original ported version using only MPEs and the refactored version using both the MPE and CPE clusters, we achieve up to 22× speedup for the compute-intensive kernels. For the 25km resolution CAM global model, we manage to scale to 24,000 MPEs, and 1,536,000 CPEs, and achieve a simulation speed of 2.81 model years per day.
机译:本文报告了我们在Sunway TaihuLight超级计算机上重构和优化社区气氛模型(CAM)的努力,该超级计算机使用由管理处理元素(MPE)和计算处理元素(CPE)集群组成的多核处理器。为了将CAM的大型代码库映射到Sunway系统上的数百万个内核,我们将基于OpenACC的重构作为主要方法,并应用源到源转换器工具为CPE集群开发最合适的并行性,并且使中间变量适合有限的片上快速缓冲区。对于单个内核,将仅使用MPE的原始移植版本与同时使用MPE和CPE集群的重构版本进行比较时,我们将计算密集型内核的速度提高了22倍。对于分辨率为25km的CAM全局模型,我们设法扩展到24,000个MPE和1,536,000个CPE,并实现了每天2.81个模型年的仿真速度。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号