Intensive computing on a large data volume with a short-vector single instruction multiple data processor

Ungurean I.; Gaitan V.-G.; Gaitan N.-C.

首页> 外文期刊>Computers & Digital Techniques, IET >Intensive computing on a large data volume with a short-vector single instruction multiple data processor

【24h】

Intensive computing on a large data volume with a short-vector single instruction multiple data processor

机译：使用短向量单指令多数据处理器对大数据量进行密集计算

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, the authors want to evaluate the performances of the PowerXCell 8i processor, which is based on Cell Broadband Engine architecture. For this purpose, the authors chose an algorithm for the k-nearest neighbour problem. The authors optimised this algorithm for efficient exploitation of the facilities provided by this architecture. The authors evaluated the PowerXCell 8i performances by algorithm execution with single- and double-precision calculations. For both cases, the performances were evaluated with and without SIMDisation. For single-precision calculations, the authors achieved a maximum speed-up of 43.85 with SIMDisation by activating 6 synergetic processor element (SPE) processors and 39.73 without SIMDisation by activating 16 SPE processors. For double-precision calculations, the authors achieved a maximum speed-up of 34.79 with SIMDisation by activating 9 SPE processors and 32.71 without SIMDisation by activating 12 SPE processors. These values related to the execution on the PowerPC processor element processor and are due to the accessing way of the main memory by the SPE cores, through the DMA transfers who are performed in parallel with the computing operations. The authors conclude that this process can be efficiently used for the execution of algorithms that require intensive computations on huge data volume.

机译：在这项研究中，作者希望评估PowerXCell 8i处理器的性能，该处理器基于Cell Broadband Engine体系结构。为此，作者选择了k最近邻问题的算法。作者优化了此算法，以有效利用此体系结构提供的功能。作者使用单精度和双精度计算通过算法执行来评估PowerXCell 8i的性能。对于这两种情况，在有和没有SIMDisation的情况下都对性能进行了评估。对于单精度计算，作者通过激活6个协同处理器元件（SPE）处理器，使SIMD化达到了43.85的最大加速速度；而通过激活16个SPE处理器，使SIMD实现了39.73的最大加速速度。对于双精度计算，作者通过激活9个SPE处理器实现了SIMD化的最大加速比为34.79，而通过激活12个SPE处理器实现了没有SIMDisation的最大加速比为32.71。这些值与PowerPC处理器元素处理器上的执行有关，并且归因于SPE内核通过与计算操作并行执行的DMA传输对主存储器的访问方式。作者得出的结论是，该过程可以有效地用于执行需要对大量数据进行大量计算的算法。

著录项

来源
《Computers & Digital Techniques, IET》 |2014年第5期|219-228|共10页
作者
Ungurean I.; Gaitan V.-G.; Gaitan N.-C.;
展开▼
作者单位

Stefan cel Mare University of Suceava, Faculty of Electrical Engineering and Computer Science, Romania;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Demonstration and architectural analysis of complementary metal-oxide semiconductor/multiple-quantum-well smart-pixel array cellular logic processors for single-instruction multiple-data parallel-pipeline processing [J] . Jen-Ming Wu, Charles B. Kuznia, Bogdan Hoanca Applied optics . 1999,第11期

机译：用于单指令多数据并行流水线处理的互补金属氧化物半导体/多量子阱智能像素阵列细胞逻辑处理器的演示和体系结构分析
2. Single instruction multiple data code auto generation for a very long instruction words digital signal processor in sensor-based systems [J] . Xu Yang, Yanjun Zhang, Dake Liu, Wireless Sensor Systems, IET . 2013,第2期

机译：基于传感器的系统中的超长指令字数字信号处理器的单指令多数据代码自动生成
3. Towards building a data-intensive index for big data computing - A case study of Remote Sensing data processing [J] . Ma Yan, Wang Lizhe, Liu Peng, Information Sciences: An International Journal . 2015,第Null期

机译：致力于为大数据计算建立数据密集型索引-遥感数据处理案例研究
4. Business intelligence: Self adapting and prioritizing database algorithm for providing big data insight in domain knowledge and processing of volume based instructions based on scheduled and contextual shifting of data [C] . Mazhar Hameed, Usman Qamar, Usman Akram 2016 Future Technologies Conference . 2016

机译：商业智能：自适应和优先级高的数据库算法，可提供基于领域的知识的大数据洞察力，并基于数据的调度和上下文转移提供基于卷的指令处理
5. Integrated Management of the Persistent-Storage and Data-Processing Layers in Data-Intensive Computing Systems. [D] . Borisov, Nedyalko. 2012

机译：数据密集型计算系统中持久性存储和数据处理层的集成管理。
6. Granular computing with multiple granular layers for brain big data processing [O] . Guoyin Wang, Ji Xu 2014

机译：具有多个粒度层的粒度计算用于大脑大数据处理
7. Demonstration and architectural analysis of complementary metal-oxide semiconductor/multiple-quantum-well smart-pixel array cellular logic processors for single-instruction multiple-data parallel-pipeline processing [O] . Wu JM 2012

机译：用于单指令多数据并行流水线处理的互补金属氧化物半导体/多量子阱智能像素阵列细胞逻辑处理器的演示和体系结构分析
8. Distributed Computing for Signal Processing: Modeling of Asynchronous Parallel Computation. Appendix D. Analysis of MIMD (Multiple Instruction Streams, Multiple Data Streams) Algorithms: Features, Measurements, and Results [R] . Smith, K. D. 1984

机译：信号处理的分布式计算：异步并行计算的建模。附录D. mImD（多指令流，多数据流）算法的分析：特征，测量和结果

Intensive computing on a large data volume with a short-vector single instruction multiple data processor

摘要

著录项

相似文献

相关主题

期刊订阅