Locality Aware Memory Assignment and Tiling

机译：临时意识到内存分配和平铺

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the trend toward specialization, an efficient memory-path design is vital to capitalize customization in data-path. A monolithic memory hierarchy is often highly inefficient for irregular applications, traditionally targeted for CPUs. New approaches and tools are required to offer application-specific memory customization combining the benefits of cache and scratchpad memory simultaneously.This paper introduces a novel approach for automated application-specific on-chip memory assignment and tiling. The approach offers two major tools: (1) static memory access analysis and (2) variable-level memory assignment. Static memory analysis performs at the LLVM abstraction. It extracts target-independent pointer behaviors, measures the access strides and analyze the prefetchability of variables. (2) variable-level memory assignment creates a memory allocation graph for memory assignment (cache vs. scratchpad) based on the variables size and their estimated locality. It also explores the opportunity for tiling memory access. For the exploration and results, this paper uses Machsuite benchmarks (with both regular & irregular memory access behaviors), and gem5-Aladdin tool for performance & power evaluation. The proposed approach optimizes the memory hierarchy by automatically combining the benefits of cache, (tiled-) scratchpad at variable level granularity per individual applications. The results demonstrate more than 45% improvement in our power-stall product, on average, over the monolithic cache or scratchpad design.

机译：凭借专业化的趋势，高效的内存路径设计对于将自定义进行资本化数据路径至关重要。单片内存层次结构通常对不规则应用的效率高，传统上针对CPU。新方法和工具都需要提供特定于应用程序的内存自定义，同时组合缓存和刮板内存的好处。本文介绍了一种自动应用程序特定的片上存储器分配和平铺的新方法。该方法提供了两个主要工具：（1）静态内存访问分析和（2）可变级内存分配。静态存储器分析在LLVM抽象中执行。它提取目标独立的指针行为，测量访问进程并分析变量的可预取性。（2）可变级别存储器分配基于变量大小及其估计的位置创建用于存储器分配（缓存与临时局）的内存分配图。它还探讨了平铺内存访问的机会。对于勘探和结果，本文使用Machsuite基准（具有常规和不规则内存访问行为）和Gem5-Aladdin工具进行性能和功率评估。该方法通过自动将缓存（Tiled-）Scratchpad的优势自动组合每个单独的应用程序的可变级别粒度的优势来优化内存层次结构。结果展示了我们的功率 - 摊位产品，平均在整体高速缓存或刮板设计上有超过45 ％。

著录项

来源
《ACM/ESDA/IEEE Design Automation Conference》|2018年|514p|共6页
会议地点
作者
Samuel Rogers; Hamed Tabkhi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术及设备;
关键词
Indexes; Tools; Memory management; Optimization; System-on-chip; Prefetching; Resource management;

机译：索引;工具;内存管理;优化;片上系统;预取;资源管理;

相似文献

外文文献
中文文献
专利

1. A Spatial and Temporal Locality-Aware Adaptive Cache Design With Network Optimization for Tiled Many-Core Architectures [J] . Mingyu Wang, Zhaolin Li IEEE transactions on very large scale integration (VLSI) systems . 2017,第9期

机译：面向网络的平铺多核体系结构的时空局部性自适应缓存设计与网络优化
2. A spill data aware memory assignment technique for improving power consumption of multimedia memory systems [J] . Youn Jonghee, Cho Doosan Multimedia Tools and Applications . 2019,第5期

机译：溢出数据感知内存分配技术，用于提高多媒体存储系统的功耗
3. DeLoc: A Locality and Memory-Congestion-Aware Task Mapping Method for Modern NUMA Systems [J] . Agung Mulya, Amrizal Muhammad Alfian, Egawa Ryusuke, Quality Control, Transactions . 2020,第期

机译：DELOC：现代NUMA系统的局部性和内存 - 拥塞式任务映射方法
4. Locality Aware Memory Assignment and Tiling [C] . Samuel Rogers, Hamed Tabkhi 2018 55th ACM/ESDA/IEEE Design Automation Conference . 2018

机译：位置感知的内存分配和切片
5. Distributed Query Processing Over Incomplete, Sampled, and Locality-Aware Data [D] . Sundarmurthy, Bruhathi. 2018

机译：对不完整，采样和位置感知的数据进行分布式查询处理
6. Awareness and attitude towards dehydration and its management amongst mothers and factors influence on in under-five children of Omdurman locality Sudan [O] . Hiba M. A. Mohamed, Faiza S. M. Mohammed 2020

机译：苏丹欧洲曼务人口局域网中偏离脱水及其管理的意识及其管理
7. Locality-aware Connection Management and Rank Assignment for Wide-area MPI [O] . Hideo Saito et al. 2007

机译：广域MPI的区域感知连接管理和等级分配
8. Memory Reference Locality and Periodic Relocation in Main Memory Search Trees [R] . Oksanen, K., Malmi, L. 1995

机译：主存储器搜索树中的存储器参考位置和周期重定位

Locality Aware Memory Assignment and Tiling

摘要

著录项

相似文献

相关主题

期刊订阅