...
首页> 外文期刊>Concurrency, practice and experience >OpenCL performance portability for general-purpose computation on graphics processor units: an exploration on cryptographic primitives
【24h】

OpenCL performance portability for general-purpose computation on graphics processor units: an exploration on cryptographic primitives

机译:用于图形处理器单元上通用计算的OpenCL性能可移植性:对密码原语的探索

获取原文
获取原文并翻译 | 示例
           

摘要

The modern trend toward heterogeneous many-core architectures has led to high architectural diversity inrnboth high performance and high-end embedded systems. To effectively exploit the computational resourcesrnof such a wide range of architectures, programming languages and APIs such as OpenCL have becomernincreasingly popular. Although OpenCL provides functional code portability and the ability to fine tunernthe application to the target hardware, providing performance portability is still an open problem. Thus,rnmany research works have investigated the optimization of specific combinations of application and targetrnplatform. In this paper, we aim at leveraging the experience obtained in the implementation of algorithmsrnfrom the cryptography domain to provide a set of guidelines for modern many-core heterogeneous architecturernperformance portability and to establish a base on which domain-specific languages and compilerrntransformations could be built in the near future. We study algorithmic choices and the effect of compilerrntransformations on three representative applications in the chosen domain on a set of seven target platforms.rnTo estimate how well the application fits the architecture, we define a metric of computational intensity bothrnfor the architecture and the application implementation. Besides being useful to compare either differentrnimplementation or algorithmic choices and their fitness to a specific architecture, it can also be useful to therncompiler to guide the code optimization process.
机译:异构多核架构的现代趋势已导致高性能和高端嵌入式系统之间的高度架构多样性。为了有效地利用计算资源,在如此广泛的体系结构中,诸如OpenCL之类的编程语言和API变得越来越流行。尽管OpenCL提供了功能代码的可移植性以及将应用程序微调到目标硬件的能力,但是提供性能可移植性仍然是一个未解决的问题。因此,许多研究工作已经研究了应用程序和目标平台的特定组合的优化。在本文中,我们旨在利用从密码学领域实施算法获得的经验,为现代多核异构体系结构的性能可移植性提供一套指南,并为构建特定于领域的语言和编译器转换奠定基础。不久的将来。我们研究算法的选择以及编译器转换对一组七个目标平台上选定域中三个代表性应用程序的影响。为了估计应用程序对体系结构的适应程度,我们定义了体系结构和应用程序实现的计算强度度量。除了可以比较不同的实现或算法选择及其对特定体系结构的适用性之外,它对于编译器指导代码优化过程也很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号