面向SLP的多重循环向量化

魏帅; 赵荣彩; 姚远

首页> 中文期刊> 《软件学报》 >面向SLP的多重循环向量化

面向SLP的多重循环向量化

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

如今,越来越多的处理器集成了SIMD(single instruction multiple data)扩展,现有的编译器大多也实现了自动向量化的功能,但是一般都只针对最内层循环进行向量化,对于多重循环缺少一种通用、易行的向量化方法.为此,提出了一种面向SLP(supcrword level parallelism)的多重循环向量化方法,从外至内依次对各个循环层次进行分析,收集各层循环对应的一些影响向量化效果的属性值,主要包括能否对该循环进行直接循环展开和压紧、有多少数组引用相对于该循环索引连续以及该循环所包含的区域等,然后根据这些属性值决定在哪些循环层次进行直接循环展开和压紧,最后通过SLP对循环中的语句进行向量化.实验结果表明,该算法相对于内层循环向量化和简单的外层循环向量化平均加速比提升了2.13和1.41,对于一些常用的核心循环可以得到高达5.3的加速比.%Nowadays, more and more processors are integrated with SIMD (single instruction multiple data) extensions, and most of the compilers have applied automatic vectorization, but the vectorization usually targets the innermost loop, there have been no easy vectorization approaches that deal with the loop nest. This paper brings out an automatic vectorization approach to vectorize nested loops form outer to inner. The paper first analyzes whether the loop can do direct unroll-and-jam through dependency analysis. Next, this study collects the values about the loop that will influence vectorization performance, including whether it can do direct unroll-and-jam, the number of array references that are continuous for this loop index and the loop region. Moreover, the study also presents an aggressive algorithm that will be used to decide which loops need to do unroll-and-jam at last generate SIMD code using SLP (superword level parallelism) algorithm. The test results on Intel platform show that the average speedup factor of some numerical/video/communication kernels achieved by this approach is 2.13/1.41, better than the innermost loop vectorization and simple outer-loop veclorization, the speedup factor of some common kernels can reach 5.3.

著录项

来源
《软件学报》 |2012年第7期|1717-1728|共12页
作者
魏帅; 赵荣彩; 姚远;
展开▼
作者单位

解放军信息工程大学信息工程学院;

河南郑州450002;

解放军信息工程大学信息工程学院;

河南郑州450002;

解放军信息工程大学信息工程学院;

河南郑州450002;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
SIMD; 向量化; 依赖关系分析; 多重循环; 超字并行;

相似文献

中文文献
外文文献
专利

1. 面向部分向量化的循环分布及聚合优化 [J] . 韩林 ,徐金龙 ,李颖颖 . 计算机科学 . 2017,第002期
2. 面向SIMD向量化的循环优化技术研究 [J] . 高伟 ,徐金龙 ,孙回回 . 信息工程大学学报 . 2016,第004期
3. 一种基于剪切的SLP向量化方法 [J] . 李颖颖 ,奚慧兴 ,高伟 . 计算机应用研究 . 2018,第9期
4. 向量化多重网格的Mac Cormack显格式解轴对称底部干扰流场 [J] . 朱广生 ,王承尧 . 空气动力学学报 . 1991,第002期
5. 面向国产平台的LLVM自动向量化移植与优化 [J] . 李嘉楠 ,韩林 ,柴赟达 . 计算机工程 . 2022,第1期
6. 基于LLVM实现尾循环向量化 [C] . 黄亚斌 ,李春江 . 第二十届计算机工程与工艺年会暨第六届微处理器技术论坛 . 2016
7. LLVM循环向量化研究 [A] . 黄亚斌 . 2016

面向SLP的多重循环向量化

摘要

著录项

相似文献

相关主题

期刊订阅