首页> 外文会议>2015 International Conference on Parallel Architecture and Compilation >Parallel Methods for Verifying the Consistency of Weakly-Ordered Architectures
【24h】

Parallel Methods for Verifying the Consistency of Weakly-Ordered Architectures

机译:验证弱序体系结构一致性的并行方法

获取原文
获取原文并翻译 | 示例

摘要

Contemporary microprocessors use relaxed memory consistency models to allow for aggressive optimizations in hardware. This enhancement in performance comes at the cost of design complexity and verification effort. In particular, verifying an execution of a program against its system's memory consistency model is an NP-complete problem. Several graph-based approximations to this problem based on carefully constructed randomized test programs have been proposed in the literature, however, such approaches are sequential and execute slowly on large graphs of interest. Unfortunately, the ability to execute larger tests is tremendously important, since such tests enable one to expose bugs more quickly. Successfully executing more tests per unit time is also desirable, since it allows for one to check for a greater variety of errors in the memory subsystem by utilizing a more diverse set of tests. This paper improves upon existing work by introducing an algorithm that not only reduces the time complexity of the verification process, but also facilitates the development of parallel algorithms for solving these problems. We first show performance improvements from a sequential approach and gain further performance from parallel implementations in OpenMP and CUDA. For large tests of interest, our GPU implementation achieves an average application speedup of 26.36x over existing techniques in use at NVIDIA.
机译:当代的微处理器使用宽松的内存一致性模型来允许对硬件进行积极的优化。性能的提高是以设计复杂性和验证工作为代价的。特别是,对照其系统的内存一致性模型验证程序的执行是一个NP完全问题。在文献中已经提出了一些基于精心设计的随机测试程序的基于图的近似方法,但是这种方法是顺序的,并且在感兴趣的大图上执行缓慢。不幸的是,执行大型测试的能力非常重要,因为这样的测试可以使人们更快地发现错误。还希望每单位时间成功执行更多的测试,因为它允许通过利用一组更多样化的测试来检查内存子系统中错误的种类。本文通过引入一种算法来改进现有工作,该算法不仅可以减少验证过程的时间复杂度,而且可以促进解决这些问题的并行算法的开发。我们首先展示了顺序方法的性能改进,并通过OpenMP和CUDA中的并行实现获得了进一步的性能。对于感兴趣的大型测试,我们的GPU实施比NVIDIA使用的现有技术实现了平均26.36倍的应用程序加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号