Locality Aware Memory Assignment and Tiling

机译：临时意识到内存分配和平铺

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the trend toward specialization, an efficient memory-path design is vital to capitalize customization in data-path. A monolithic memory hierarchy is often highly inefficient for irregular applications, traditionally targeted for CPUs. New approaches and tools are required to offer application-specific memory customization combining the benefits of cache and scratchpad memory simultaneously. This paper introduces a novel approach for automated application-specific on-chip memory assignment and tiling. The approach offers two major tools: (1) static memory access analysis and (2) variable-level memory assignment. Static memory analysis performs at the LLVM abstraction. It extracts target-independent pointer behaviors, measures the access strides and analyze the prefetchability of variables. (2) variable-level memory assignment creates a memory allocation graph for memory assignment (cache vs. scratchpad) based on the variables size and their estimated locality. It also explores the opportunity for tiling memory access. For the exploration and results, this paper uses Machsuite benchmarks (with both regular & irregular memory access behaviors), and gem5-Aladdin tool for performance & power evaluation. The proposed approach optimizes the memory hierarchy by automatically combining the benefits of cache, (tiled-) scratchpad at variable level granularity per individual applications. The results demonstrate more than 45% improvement in our power-stall product, on average, over the monolithic cache or scratchpad design.

机译：朝向专业化的趋势，一个高效的存储器路径设计是至关重要的利用在数据通路定制。单片存储器层次往往是不规则的应用，传统的针对CPU的效率非常低。新的方法和工具都需要专用内存提供定制同时结合缓存和暂存存储器的好处。本文介绍了用于特定应用的自动片上存储器分配和平铺的新方法。该方法提供了两个主要的工具：（1）静态存储器访问分析和（2）可变级别的内存分配。在LLVM抽象静态存储器分析进行。它提取目标无关的指针行为，测量访问步幅和分析的变量prefetchability。（2）可变级存储器分配创建基于变量的大小和其估计为局部性存储器分配的存储器分配图（缓存与暂存器）。它还探讨了碎片存储器访问的机会。勘探与结果，本文采用Machsuite基准（两者经常与不规则的内存访问行为），以及性能和电源评测gem5 - 阿拉丁工具。所提出的方法通过在每个单独的应用可变级粒度自动地合并的高速缓存，（tiled-）暂存器的好处优化存储器层级。结果表明超过45％的改善我们的力量失速产品，平均而言，在整体高速缓存或暂存器的设计。

著录项

来源
《ACM/ESDA/IEEE Design Automation Conference》|2018年|515-1076p|共6页
会议地点
作者
Samuel Rogers; Hamed Tabkhi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Spatial and Temporal Locality-Aware Adaptive Cache Design With Network Optimization for Tiled Many-Core Architectures [J] . Mingyu Wang, Zhaolin Li IEEE transactions on very large scale integration (VLSI) systems . 2017,第9期

机译：面向网络的平铺多核体系结构的时空局部性自适应缓存设计与网络优化
2. A spill data aware memory assignment technique for improving power consumption of multimedia memory systems [J] . Youn Jonghee, Cho Doosan Multimedia Tools and Applications . 2019,第5期

机译：溢出数据感知内存分配技术，用于提高多媒体存储系统的功耗
3. DeLoc: A Locality and Memory-Congestion-Aware Task Mapping Method for Modern NUMA Systems [J] . Agung Mulya, Amrizal Muhammad Alfian, Egawa Ryusuke, Quality Control, Transactions . 2020,第期

机译：DELOC：现代NUMA系统的局部性和内存 - 拥塞式任务映射方法
4. Locality Aware Memory Assignment and Tiling [C] . Samuel Rogers, Hamed Tabkhi 2018 55th ACM/ESDA/IEEE Design Automation Conference . 2018

机译：位置感知的内存分配和切片
5. Distributed Query Processing Over Incomplete, Sampled, and Locality-Aware Data [D] . Sundarmurthy, Bruhathi. 2018

机译：对不完整，采样和位置感知的数据进行分布式查询处理
6. Awareness and attitude towards dehydration and its management amongst mothers and factors influence on in under-five children of Omdurman locality Sudan [O] . Hiba M. A. Mohamed, Faiza S. M. Mohammed 2020

机译：苏丹欧洲曼务人口局域网中偏离脱水及其管理的意识及其管理
7. Locality-aware Connection Management and Rank Assignment for Wide-area MPI [O] . Hideo Saito et al. 2007

机译：广域MPI的区域感知连接管理和等级分配
8. Memory Reference Locality and Periodic Relocation in Main Memory Search Trees [R] . Oksanen, K., Malmi, L. 1995

机译：主存储器搜索树中的存储器参考位置和周期重定位

Locality Aware Memory Assignment and Tiling

摘要

著录项

相似文献

相关主题

期刊订阅