首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >The Sharing Tracker: Using Ideas from Cache Coherence Hardware to Reduce Off-Chip Memory Traffic with Non-Coherent Caches
【24h】

The Sharing Tracker: Using Ideas from Cache Coherence Hardware to Reduce Off-Chip Memory Traffic with Non-Coherent Caches

机译:共享跟踪器:使用缓存一致性硬件中的想法来减少非一致性缓存的片外内存流量

获取原文

摘要

Graphics Processing Units (GPUs) have recently emerged as a new platform for high performance, general-purpose computing. Because current GPUs employ deep multithreading to hide latency, they only have small, per-core caches to capture reuse and eliminate unnecessary off-chip accesses. This paper shows that for general-purpose workloads, the ability to copy cache lines between private caches captures inter-core temporal locality and provides substantial reductions in off-chip bandwidth requirements. Unlike hardware cache coherence, a sharing tracker only needs to track cache lines in the private caches imprecisely, because it is only a performance hint. This simplifies the implementation and is so effective at capturing inter-core reuse that the L2 can be eliminated entirely. The sharing tracker is motivated by but not specific to the GPU and could be used in other manycore organizations.
机译:图形处理单元(GPU)最近已经成为高性能,通用计算的新平台。由于当前的GPU使用深多线程来隐藏延迟,因此它们仅具有小的每核缓存来捕获重用并消除不必要的片外访问。本文表明,对于通用工作负载,在专用缓存之间复制缓存行的功能可捕获内核间的时间局部性,并大幅降低片外带宽需求。与硬件缓存一致性不同,共享跟踪器仅需要精确地跟踪专用缓存中的缓存行,因为这只是性能提示。这简化了实现,并且在捕获内核间的重用方面非常有效,因此可以完全消除L2。共享跟踪器受GPU激励但并非特定于GPU,并且可以在其他许多核心组织中使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号