首页> 外文会议>Annual International Symposium on Computer Architecture >Stash: Have your scratchpad and cache it too
【24h】

Stash: Have your scratchpad and cache it too

机译:Stash:有你的刮刀和缓存它

获取原文

摘要

Heterogeneous systems employ specialization for energy efficiency. Since data movement is expected to be a dominant consumer of energy, these systems employ specialized memories (e.g., scratchpads and FIFOs) for better efficiency for targeted data. These memory structures, however, tend to exist in local address spaces, incurring significant performance and energy penalties due to inefficient data movement between the global and private spaces. We propose an efficient heterogeneous memory system where specialized memory components are tightly coupled in a unified and coherent address space. This paper applies these ideas to a system with CPUs and GPUs with scratchpads and caches. We introduce a new memory organization, stash, that combines the benefits of caches and scratchpads without incurring their downsides. Like a scratchpad, the stash is directly addressed (without tags and TLB accesses) and provides compact storage. Like a cache, the stash is globally addressable and visible, providing implicit data movement and increased data reuse. We show that the stash provides better performance and energy than a cache and a scratchpad, while enabling new use cases for heterogeneous systems. For 4 microbenchmarks, which exploit new use cases (e.g., reuse across GPU compute kernels), compared to scratchpads and caches, the stash reduces execution cycles by an average of 27% and 13% respectively and energy by an average of 53% and 35%. For 7 current GPU applications, which are not designed to exploit the new features of the stash, compared to scratchpads and caches, the stash reduces cycles by 10% and 12% on average (max 22% and 31%) respectively, and energy by 16% and 32% on average (max 30% and 51%).
机译:异构系统采用能效专业化。由于数据移动预计是能量的主要消费者,因此这些系统采用专门的存储器(例如,Scratchpads和FIFO),以更好的目标数据效率。然而,这些存储器结构往往存在于局部地址空间中,由于全局和私人空间之间的低效数据移动而导致的显着性能和能量惩罚。我们提出了一种高效的异构内存系统,其中专用存储器组件在统一和相干的地址空间中紧密耦合。本文将这些想法应用于带有CPU和GPU的系统,带有Scratchpads和Caches。我们介绍了一个新的内存组织,藏匿,它结合了缓存和刮板的好处,而不会导致他们的缺点。与Scratchpad一样,即可直接寻址删除(无需标签和TLB访问)并提供紧凑的存储空间。与缓存一样,藏匿处是全局寻址和可见的,提供隐式数据移动和增加的数据重用。我们表明Stash提供了比缓存和Scratchpad更好的性能和能量,同时为异构系统提供新的使用情况。对于4微机器,与ScratchPads和高速缓存相比,利用新的使用情况(例如,GPU计算内核中的重用),分别将执行循环平均降低27%和13%,而能量平均为53%和35 %。对于7个目前的GPU应用,它不设计用于利用藏匿处的新功能,与Scratchpads和高速缓存相比,分别将周期降低10%和12%(最高22%和31%)和能量平均16%和32%(最多30%和51%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号