首页> 外文会议>ICA3PP 2014 >Utilizing Multiple Xeon Phi Coprocessors on One Compute Node
【24h】

Utilizing Multiple Xeon Phi Coprocessors on One Compute Node

机译:在一个计算节点上利用多个Xeon Phi共生器

获取原文

摘要

Future exascale systems are expected to adopt compute nodes that incorporate many accelerators. This paper thus investigates the topic of programming multiple Xeon Phi coprocessors that lie inside one compute node. Besides a standard MPI-OpenMP programming approach, which belongs to the symmetric usage mode, two offload-mode programming approaches are considered. The first offload approach is conventional and uses compiler pragmas, whereas the second one is new and combines Intel's APIs of coprocessor offload infrastructure (COI) and symmetric communication interface (SCIF) for low-latency communication. While the pragma-based approach allows simpler programming, the COI-SCIF approach has three advantages in (1) lower overhead associated with launching offloaded code, (2) higher data transfer bandwidths, and (3) more advanced asynchrony between computation and data movement. The low-level COI-SCIF approach is also shown to have benefits over the MPI-OpenMP counterpart. All the programming approaches are tested by a real-world 3D application, for which the COI-SCIF approach shows a performance upper hand on a Tianhe-2 compute node with three Xeon Phi coprocessors.
机译:预计未来的Exascale系统将采用包含许多加速器的计算节点。因此,本文调查了位于一个计算节点内部的多个Xeon Phi协处理器的主题。除了属于对称使用模式的标准MPI-OpenMP编程方法外,还考虑了两个卸载模式编程方法。第一个卸载方法是常规的,使用编译器Pragmas,而第二个是新的,并将英特尔的协处理器API与Coprocessor卸载基础设施(COI)和对称通信接口(SCIF)的API结合起来,以进行低延迟通信。虽然基于Pragma的方法允许更简单的编程,但COI-SCIF方法具有三个优点,其中(1)较低的开销与启动卸载代码,(2)更高的数据传输带宽,(3)计算和数据移动之间的更高级的异步。(3) 。低级别的COI-SCIF方法也显示出对MPI-OPENMP对应物的益处。所有编程方法都是由真实世界3D应用测试的,其中COI-SCIF方法在天河2计算节点上显示了具有三个Xeon Phi协处理器的天河2计算节点的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号