首页> 外文期刊>Computers & Digital Techniques, IET >Intensive computing on a large data volume with a short-vector single instruction multiple data processor
【24h】

Intensive computing on a large data volume with a short-vector single instruction multiple data processor

机译:使用短向量单指令多数据处理器对大数据量进行密集计算

获取原文
获取原文并翻译 | 示例
           

摘要

In this study, the authors want to evaluate the performances of the PowerXCell 8i processor, which is based on Cell Broadband Engine architecture. For this purpose, the authors chose an algorithm for the k-nearest neighbour problem. The authors optimised this algorithm for efficient exploitation of the facilities provided by this architecture. The authors evaluated the PowerXCell 8i performances by algorithm execution with single- and double-precision calculations. For both cases, the performances were evaluated with and without SIMDisation. For single-precision calculations, the authors achieved a maximum speed-up of 43.85 with SIMDisation by activating 6 synergetic processor element (SPE) processors and 39.73 without SIMDisation by activating 16 SPE processors. For double-precision calculations, the authors achieved a maximum speed-up of 34.79 with SIMDisation by activating 9 SPE processors and 32.71 without SIMDisation by activating 12 SPE processors. These values related to the execution on the PowerPC processor element processor and are due to the accessing way of the main memory by the SPE cores, through the DMA transfers who are performed in parallel with the computing operations. The authors conclude that this process can be efficiently used for the execution of algorithms that require intensive computations on huge data volume.
机译:在这项研究中,作者希望评估PowerXCell 8i处理器的性能,该处理器基于Cell Broadband Engine体系结构。为此,作者选择了k最近邻问题的算法。作者优化了此算法,以有效利用此体系结构提供的功能。作者使用单精度和双精度计算通过算法执行来评估PowerXCell 8i的性能。对于这两种情况,在有和没有SIMDisation的情况下都对性能进行了评估。对于单精度计算,作者通过激活6个协同处理器元件(SPE)处理器,使SIMD化达到了43.85的最大加速速度;而通过激活16个SPE处理器,使SIMD实现了39.73的最大加速速度。对于双精度计算,作者通过激活9个SPE处理器实现了SIMD化的最大加速比为34.79,而通过激活12个SPE处理器实现了没有SIMDisation的最大加速比为32.71。这些值与PowerPC处理器元素处理器上的执行有关,并且归因于SPE内核通过与计算操作并行执行的DMA传输对主存储器的访问方式。作者得出的结论是,该过程可以有效地用于执行需要对大量数据进行大量计算的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号