首页> 美国政府科技报告 >Computational Performance of Intel MIC, Sandy Bridge, and GPU Architectures: Implementation of a 1D c++/OpenMP Electrostatic Particle-In-Cell Code
【24h】

Computational Performance of Intel MIC, Sandy Bridge, and GPU Architectures: Implementation of a 1D c++/OpenMP Electrostatic Particle-In-Cell Code

机译:英特尔mIC,sandy Bridge和GpU架构的计算性能:1D c ++ / Openmp静电粒子在线代码的实现

获取原文

摘要

We present initial comparison performance results for Intel MIC, Sandy Bridge (SB), and GPU. A 1D explicit electrostatic particle-in-cell (PIC) code is used to simulate a two-stream instability in plasma. We compare the computation times for various number of cores/threads and compiler options. The parallelization is implemented via OpenMP with maximum thread number of 128. Parallelization and vectorization on the GPU is achieved with modifying the code syntax for compatibility with CUDA. We assess the speedup due to various auto-vectorization and optimization level compiler options. Our results show that the MIC is several times slower than SB for a single thread and it becomes faster than SB when the number of cores increases with vectorization switched on. The compute times for the GPU are consistently about 6-7 times faster than the ones for MIC. Compared to SB, the GPU is about 2 times faster for a single thread and about an order of magnitude faster for 128 threads. The net speedup, however, for MIC and GPU are almost the same. An initial attempt to offload parts of the code to the MIC co-processor shows that there is an optimal number of threads where the speedup reaches a maximum.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号