【24h】

Bacon: A GPU Programming System With Just in Time Specialization

机译:培根:具有即时专业性的GPU编程系统

获取原文

摘要

This paper describes Bacon, a data-parallel programming system targeting OpenCL-compatible graphics processors. This system is built upon the existing OpenCL standard in order to make it easier for programmers to write high performance kernels for GPU accelerated applications. The OpenCL C syntax is extended into a new language, Bacon C, intended to make development significantly more convenient and enabling pre-optimizations based on just-in-time specialization as this code is compiled via OpenCL at runtime.Benchmarks are provided for matrix multiplication comparing two Bacon implementations to similar OpenCL implementations. Speedups are demonstrated both for naive implementations and when comparing a Bacon implementation of generalized block decomposed matrix multiplication to a hand-vectorized OpenCL kernel. This latter result demonstrates the benefit of the total loop unrolling enabled by just-in-time specialization.
机译:本文介绍了Bacon,这是一种针对与OpenCL兼容的图形处理器的数据并行编程系统。该系统基于现有的OpenCL标准构建,以使程序员更容易为GPU加速的应用程序编写高性能内核。 OpenCL C语法已扩展为一种新语言Bacon C,该语言旨在显着简化开发过程,并在运行时通过OpenCL编译该代码时基于即时专业化实现预优化。为矩阵乘法提供了基准将两个Bacon实现与类似的OpenCL实现进行比较。在针对朴素的实现以及将广义块分解矩阵乘法的Bacon实现与手动矢量化的OpenCL内核进行比较时,都展示了加速效果。后一个结果证明了即时专业化可以实现总循环展开的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号