...
首页> 外文期刊>International Journal of High Performance Computing Applications >DESIGNING ZERO-COPY MESSAGE PASSING INTERFACE DERIVED DATATYPE COMMUNICATION OVER INFINIBAND: ALTERNATIVE APPROACHES AND PERFORMANCE EVALUATION
【24h】

DESIGNING ZERO-COPY MESSAGE PASSING INTERFACE DERIVED DATATYPE COMMUNICATION OVER INFINIBAND: ALTERNATIVE APPROACHES AND PERFORMANCE EVALUATION

机译:在无穷大上设计零拷贝消息传递通过接口的数据类型通信:替代方法和性能评估

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we present a new scheme, Send Gather Receive Scatter (SGRS), to perform zero-copy datatype communication over InfiniBand. This scheme leverages the gather/scatter feature provided by InfiniBand channel semantics. It takes advantage of the capability of processing non-contiguity on both send and receive sides in the Send Gather and Receive Scatter operations. We have implemented this new design and evaluated the performance for Message Passing Interface level point-to-point microbenchmarks and collectives, on PCI-X and upcoming high performance PCI-Express systems. In our previous work we had come up with an alternate zero-copy approach using multiple RDMA Writes (Multi-W). Compared to the existing Multi-W zero-copy datatype scheme, the SGRS scheme can overcome the drawbacks of low network utilization and high startup cost. On PCI-X platforms, our experimental results show significant improvement in both point-to-point and collective datatype communication. The latency of a vector datatype can be reduced by up to 62% and the bandwidth shows improvement up to 400% as compared with the Multi-W scheme. The Alltoall collective shows up to 23% reduction in latency. Further, the SGRS scheme shows low CPU overhead with a potential promise for better computation and communication overlap. The experimental results on PCI-Express platforms demonstrate the relevance of zero-copy protocols to overcome memory bandwidth limitations. The trends we observe in PCI-X platform are magnified on PCI-Express platforms with even higher improvement for the microbenchmarks and collectives.
机译:在本文中,我们提出了一种新方案,即发送聚集接收分散(SGRS),以通过InfiniBand执行零拷贝数据类型通信。该方案利用了InfiniBand通道语义提供的收集/分散功能。它利用“发送收集”和“接收分散”操作在发送和接收双方处理非连续性的能力。我们已经实现了这一新设计,并评估了PCI-X和即将推出的高性能PCI-Express系统上消息传递接口级别的点对点微基准和集合的性能。在我们以前的工作中,我们提出了使用多个RDMA写入(Multi-W)的另一种零复制方法。与现有的Multi-W零拷贝数据类型方案相比,SGRS方案可以克服网络利用率低和启动成本高的缺点。在PCI-X平台上,我们的实验结果表明,点对点和集体数据类型通信都得到了显着改善。与Multi-W方案相比,矢量数据类型的等待时间最多可以减少62%,带宽显示最多可以提高400%。 Alltoall集体的延迟减少了多达23%。此外,SGRS方案显示出较低的CPU开销,并有望实现更好的计算和通信重叠。 PCI-Express平台上的实验结果证明了零拷贝协议克服内存带宽限制的相关性。我们在PCI-X平台上观察到的趋势在PCI-Express平台上得到了放大,对微型基准和集合的改进甚至更高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号