首页> 外文会议>Federated Conference on Computer Science and Information Systems >Parallel cache-efficient code for computing the McCaskill partition functions
【24h】

Parallel cache-efficient code for computing the McCaskill partition functions

机译:用于计算McCaskill分区函数的并行高速缓存有效代码

获取原文

摘要

We present parallel tiled optimized McCaskill’s partition functions computation code. That CPU and memory intensive dynamic programming task is within computational biology. To optimize code, we use the authorial source-to-source TRACO compiler and compare obtained code performance to that generated with the state-of-the-art PluTo compiler based on the affine transformations framework (ATF). Although PLuTo generates tiled code with outstanding locality, it fails to parallelize tiled code. A TRACO tiling strategy uses the transitive closure of a dependence graph to avoid affine function calculation. The ISL scheduler is used to parallelize tiled loop nests. An experimental study carried out on a multi-core computer demonstrates considerable speed-up of generated code for the larger number of threads.
机译:我们展示了并行平铺优化的McCaskill的分区函数计算代码。 CPU和内存密集型动态编程任务属于计算生物学。为了优化代码,我们使用了权威的源到源TRACO编译器,并将获得的代码性能与使用基于仿射变换框架(ATF)的最新PluTo编译器生成的代码性能进行比较。尽管PLuTo生成具有出色局部性的切片代码,但是它无法并行化切片代码。 TRACO切片策略使用依赖图的可传递闭合来避免仿射函数的计算。 ISL调度程序用于并行化切片循环嵌套。在多核计算机上进行的一项实验研究表明,针对大量线程,所生成代码的速度大大提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号