Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators Based on a Domain-Specific Language for Medical Imaging

机译：基于用于医学成像的域特定语言，自动优化GPU加速器的飞行内存交易

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

An efficient memory bandwidth utilization for GPU accelerators is crucial for memory bound applications. In medical imaging, the performance of many kernels is limited by the available memory bandwidth since only a few operations are performed per pixel. For such kernels only a fraction of the compute power provided by GPU accelerators can be exploited and performance is predetermined by memory bandwidth. As a remedy, this paper investigates the optimal utilization of available memory bandwidth by means of increasing in-flight memory transactions. Instead of doing this manually for different GPU accelerators, the required CUDA and OpenCL code is automatically generated from descriptions in a Domain-Specific Language (DSL) for the considered application domain. Moreover, the DSL is extended to also support global reduction operators. We show that the generated target-specific code improves bandwidth utilization for memory-bound kernels significantly. Moreover, competitive performance compared to the GPU back end of the widely used image processing library OpenCV can be achieved.

机译：GPU加速器的有效内存带宽利用率对于内存绑定应用是至关重要的。在医学成像中，许多内核的性能受到可用的存储器带宽的限制，因为每像素仅执行少数操作。对于这种内核，只能利用GPU加速器提供的计算功率的一小部分，并且通过内存带宽预定性能。作为补救措施，本文通过增加飞行中的内存事务来调查可用内存带宽的最佳利用。而不是为不同的GPU加速器手动执行此操作，而是从所考虑的应用程序域中的域特定语言（DSL）的描述中自动生成所需的CUDA和OpenCL代码。此外，DSL扩展到还支持全局还原运营商。我们表明生成的目标特定代码显着提高了内存绑定内核的带宽利用率。此外，与广泛使用的图像处理库OpenCV的GPU后端相比，可以实现竞争性能。

著录项

来源
《International Symposium on Parallel and Distributed Computing》|2012年||共8页
会议地点
作者
Membarth Richard; Hannig Frank; Teich Jurgen; Korner Mario; Eckert Wieland;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP316.4-53;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic generation of Truffle-based interpreters for Domain-Specific Languages [J] . Manuel Leduc, Gwendal Jouneaux, Thomas Degueule, The Journal of object technology . 2020,第2期

机译：自动生成域特定语言的基于Truffle的解释器
2. Automatic Semantic Indexing Of Medical Images Using A Web Ontologylanguage For Case-based Image Retrieval [J] . Gowri Allampalli-Nagaraj, Isabelle Bichindaritz Engineering Applications of Artificial Intelligence . 2009,第1期

机译：使用Web本体语言进行基于案例的图像检索的医学图像自动语义索引
3. Towards High-Performance Code Generation for Multi-GPU Clusters Based on a Domain-Specific Language for Algorithmic Skeletons [J] . Fabian Wrede, Herbert Kuchen International journal of parallel programming . 2020,第4期

机译：基于算法骨架的域特定语言，对多GPU集群的高性能代码生成
4. Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators Based on a Domain-Specific Language for Medical Imaging [C] . Membarth Richard, Hannig Frank, Teich Jurgen, 2012 11th International Symposium on Parallel and Distributed Computing. . 2012

机译：基于领域特定语言的医学成像，用于GPU加速器的飞行中内存事务自动优化
5. Language support and compiler optimizations for object-based software transactional memory. [D] . Eddon, Guy. 2008

机译：基于对象的软件事务存储器的语言支持和编译器优化。
6. DOPA: GPU-based protein alignment using database and memory access optimizations [O] . Laiq Hasan, Marijn Kentie, Zaid Al-Ars 2011

机译：DOPA：使用数据库和内存访问优化的基于GPU的蛋白质比对
7. Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators based on a Domain-Specific Language for Medical Imaging [O] . Richard Membarth, Frank Hannig, Jürgen Teich, 2014

机译：基于医学成像领域特定语言的GPU加速器的机内内存事务自动优化

Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators Based on a Domain-Specific Language for Medical Imaging

摘要

著录项

相似文献

相关主题

期刊订阅