首页> 外文会议> >Unified system level reliability evaluation methodology for multiprocessor Systems-on-Chip
【24h】

Unified system level reliability evaluation methodology for multiprocessor Systems-on-Chip

机译:多处理器片上系统的统一系统级可靠性评估方法

获取原文
获取原文并翻译 | 示例

摘要

Reliability is a growing fundamental challenge in the design of multiprocessor Systems-on-Chip (MPSoCs). This trend is accelerated by the increasingly adverse process variations and wearout mechanisms that result in an increased number of errors. Previously proposed fault-tolerant techniques are ad-hoc and target processors or Networks-on-Chip (NoC) separately. Because each of these two units may become a reliability bottleneck for NoC based multiprocessor SoCs, it is imperative that the reliability of SoCs be evaluated and addressed in a unified manner, as a combination of communication and computational units. Using this holistic approach, in this paper, we propose a new architecture level unified reliability evaluation methodology for MPSoCs. At the core of the reliability estimation engine lies a Monte Carlo algorithm which works with failure times for time-dependent dielectric breakdown (TDDB) and negative bias temperature instability (NBTI) modeled as Weibull distributions. To demonstrate its usefulness, we utilize the proposed methodology to explore the impact of NoC router layout on the failure time of the system running the same set of benchmarks. In addition, we investigate the failure time of the system when the NoC as the communication unit of the MPSoC is taken or not - as in previous work - into consideration. Our simulation framework can be very helpful to architecture designers, who could use it to identify architectural characteristics and to develop design techniques meant to improve system's lifetime.
机译:在多处理器片上系统(MPSoC)的设计中,可靠性是一个日益严峻的基本挑战。由于趋势越来越不利的工艺变化和磨损机制(导致错误数量增加)而加速了这种趋势。先前提出的容错技术分别是ad-hoc和目标处理器或片上网络(NoC)。因为这两个单元中的每一个都可能成为基于NoC的多处理器SoC的可靠性瓶颈,所以必须以通信和计算单元的组合,以统一的方式评估和解决SoC的可靠性。本文采用这种整体方法,为MPSoC提出了一种新的体系结构级统一可靠性评估方法。可靠性估计引擎的核心是蒙特卡洛算法,该算法可将故障时间与时间相关的电介质击穿(TDDB)和负偏置温度不稳定性(NBTI)建模为Weibull分布。为了证明其有用性,我们利用提出的方法来研究NoC路由器布局对运行相同基准测试系统的故障时间的影响。此外,我们将调查是否考虑了将NoC作为MPSoC的通信单元时的系统故障时间(如先前的工作一样)。我们的仿真框架对体系结构设计人员非常有用,他们可以使用它来识别体系结构特征并开发旨在提高系统寿命的设计技术。

著录项

  • 来源
    《》|2012年|p.1- 6|共6页
  • 会议地点 San Jose CA(US)
  • 作者单位

    Department of Electrical and Computer Engineering, North Dakota State University, Fargo, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 能源;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号