首页> 外文学位 >Serial Code Accelerators for Heterogeneous Multi-core Processor with Three-Dimensional Stacked Memory.
【24h】

Serial Code Accelerators for Heterogeneous Multi-core Processor with Three-Dimensional Stacked Memory.

机译:具有三维堆栈存储器的异构多核处理器的串行代码加速器。

获取原文
获取原文并翻译 | 示例

摘要

Asymmetric multi-core would be the pathway for future microprocessors with different cores performing different functionality. Of particular interest is the handling of serial and parallel code present in a given application. Performance improvement in multi-core microprocessors is limited by Amdahl's Law which states that speed up achieved by adding more cores gets saturated because of the presence of serial code in all applications. This calls for heterogeneous core integration of a fast core that accelerates serial code along with other CMOS cores that execute parallel component of the application. Performance improvement of parallel code can be improved by adding more cores, while higher clock rates can benefit serial. In either case, there is an associated memory wall problem due to limited bandwidth. We therefore require 3D stacked memory to overcome this problem. In this thesis, I evaluate high clock rate processors as well as shared memory processors with large number of cores with 3D stacked memory. Since clock rates for CMOS have tended to saturate due to wire scaling problems and excessive heat dissipation, one must look to an alternate three terminal device, which is compatible with CMOS i.e. BJTs. SiGe Hetero-junction Bipolar Transistor (HBT) BiCMOS process is used to build such fast digital chips that can clock in the 20-30GHz at reasonable power levels and densities. 3D memory stacked on top of a processor core can provide several advantages from wide bandwidth to multiport caches serving multiple cores. Memory Processor chip stacking reduces this Memory Wall problem by using a large number of vertical vias between tiers in the stack, for ultra wide bit path transfer of data and address information to and from various levels of cache. Chips have been successfully fabricated and tested in the 3D MITLL process as well as SiGe BiCMOS process. Thermal modeling for possible integration of these two processes has been carried out. The research is progressing towards the heterogeneous core integration with 3D memory.
机译:不对称的多核将成为未来具有不同核执行不同功能的微处理器的途径。特别令人感兴趣的是处理给定应用程序中存在的串行和并行代码。阿姆达尔定律(Amdahl's Law)限制了多核微处理器的性能提高,该定律指出,由于所有应用中都存在串行代码,因此增加更多内核所达到的速度将达到饱和。这就需要快速内核的异构内核集成,以加速串行代码以及执行应用程序并行组件的其他CMOS内核。可以通过添加更多内核来提高并行代码的性能,而更高的时钟速率可以使串行受益。在这两种情况下,由于带宽有限,都存在相关的内存墙问题。因此,我们需要3D堆栈存储器来解决此问题。在本文中,我评估了高时钟速率处理器以及具有3D堆栈存储器的具有大量内核的共享存储器处理器。由于CMOS的时钟速率由于线径问题和过多的散热而趋于饱和,因此必须寻找与CMOS即BJT兼容的另一三端设备。 SiGe异质结双极晶体管(HBT)BiCMOS工艺用于构建可以在合理的功率水平和密度下以20-30GHz时钟运行的快速数字芯片。堆叠在处理器内核顶部的3D内存可提供多种优势,从宽带宽到为多个内核服务的多端口缓存。内存处理器芯片堆栈通过在堆栈的各层之间使用大量垂直过孔来减少与内存层之间的数据和地址信息的超宽位路径传输,从而减少了内存墙问题。芯片已经在3D MITLL工艺以及SiGe BiCMOS工艺中成功制造和测试。已经对这两个过程的可能集成进行了热建模。研究正在朝着与3D内存进行异构内核集成的方向发展。

著录项

  • 作者

    Jacob, Philip.;

  • 作者单位

    Rensselaer Polytechnic Institute.;

  • 授予单位 Rensselaer Polytechnic Institute.;
  • 学科 Engineering Computer.;Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 209 p.
  • 总页数 209
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号