【24h】

Petascale Computing with Accelerators

机译:用加速器计算PetaScale Computing

获取原文

摘要

A trend is developing in high performance computing in which commodity processors are coupled to various types of computational accelerators. Such systems are commonly called hybrid systems. In this paper, we describe our experience developing an implementation of the Linpack benchmark for a petascale hybrid system, the LANL Roadrunner cluster built by IBM for Los Alamos National Laboratory. This system combines traditional x86-64 host processors with IBM PowerXCell 8i accelerator processors. The implementation of Linpack we developed was the first to achieve a performance result in excess of 1.0 PFLOPS, and made Roadrunner the #1 system on the Top500 list in June 2008. We describe the design and implementation of hybrid Linpack, including the special optimizations we developed for this hybrid architecture. We then present actual results for single node and multi-node executions. From this work, we conclude that it is possible to achieve high performance for certain applications on hybrid architectures when careful attention is given to efficient use of memory bandwidth, scheduling of data movement between the host and accelerator memories, and proper distribution of work between the host and accelerator processors.
机译:在高性能计算中开发了一种趋势,其中商品处理器耦合到各种类型的计算加速器。这种系统通常称为混合系统。在本文中,我们将介绍我们的经验开发了千万亿次混合动力系统的Linpack基准测试的实现,IBM为美国洛斯阿拉莫斯国家实验室建造的LANL Roadrunner的集群。该系统将传统的X86-64主机处理器与IBM PowerXcell 8i加速器处理器相结合。我们开发的LINPACK的实施是第一个实现的性能结果超过1.0 PFLOPS,并在2008年6月在TOW500列表中制作了#1系统。我们描述了混合林板的设计和实施,包括我们的特殊优化为该混合架构开发。然后,我们为单节点和多节点执行呈现实际结果。从这项工作中,我们得出结论,当仔细注意有效地使用内存带宽,主机和加速器存储器之间的数据移动的调度时,可以实现对混合架构的某些应用的高性能,并且在主机和加速器处理器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号