首页> 外文会议>2019 Spring Simulation Conference >Gem5-X: A Gem5-Based System Level Simulation Framework to Optimize Many-Core Platforms
【24h】

Gem5-X: A Gem5-Based System Level Simulation Framework to Optimize Many-Core Platforms

机译:Gem5-X:基于Gem5的系统级仿真框架,用于优化多核平台

获取原文
获取原文并翻译 | 示例

摘要

The rapid expansion of online-based services requires novel energy and performance efficient architectures to meet power and latency constraints. Fast architectural exploration has become a key enabler in the proposal of architectural innovation. In this paper, we present gem5-X, a gem5-based system level simulation framework, and a methodology to optimize many-core systems for performance and power. As real-life case studies of many-core server workloads, we use real-time video transcoding and image classification using convolutional neural networks (CNNs). Gem5-X allows us to identify bottlenecks and evaluate the potential benefits of architectural extensions such as in-cache computing and 3D stacked High Bandwidth Memory. For real-time video transcoding, we achieve 15% speed-up using in-order cores with in-cache computing when compared to a baseline in-order system and 76% energy savings when compared to an Out-of-Order system. When using HBM, we further accelerate real-time transcoding and CNNs by up to 7% and 8% respectively.
机译:基于在线的服务的快速扩展需要新颖的能源和性能高效的体系结构来满足功率和延迟限制。快速的建筑探索已成为建筑创新提案的关键推动力。在本文中,我们介绍了gem5-X,一个基于gem5的系统级仿真框架以及一种用于优化多核系统的性能和功耗的方法。作为多核服务器工作负载的实际案例研究,我们使用卷积神经网络(CNN)进行实时视频转码和图像分类。 Gem5-X使我们能够识别瓶颈并评估架构扩展的潜在优势,例如缓存内计算和3D堆栈式高带宽内存。对于实时视频转码,与基线有序系统相比,我们使用有序内核和高速缓存计算可将速度提高15%,与无序系统相比则可节省76%的能源。使用HBM时,我们分别将实时转码和CNN分别提高了7%和8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号