首页> 外文学位 >Exploring efficient architecture design for thread-level speculation---Power and performance perspectives.
【24h】

Exploring efficient architecture design for thread-level speculation---Power and performance perspectives.

机译:探索用于线程级推测的有效体系结构设计-功率和性能方面的观点。

获取原文
获取原文并翻译 | 示例

摘要

With the advent of multi-threaded (e.g. simultaneous multi-threading (SMT) [1, 2]) and/or multi-core (e.g. chip multiprocessors (CMP) [3, 4]) architectures, now the challenge is to utilize these architectures to improve performance of general-purpose applications. However, traditional parallelizing compilers often fail to effectively parallelize general-purpose applications which typically have complex control flow and excessive pointer usage. Thread-Level Speculation (TLS) have been proposed to simplify the task of parallelization by using speculative threads. Though the performance of TLS has been studied in the past, its power consumption, power efficiency and thermal behavior are not well understood. Also previous work on TLS have concentrated on multi-core based architectures and relatively little has been done on supporting TLS on multi-threaded architectures. With increasing multi-threaded/multi-core design choices, it is important to understand the benefits of the different types of architectures.;The goal of this dissertation is to explore architecture techniques to efficiently implement TLS in future multi-threaded/multi-core processors. The dissertation proposes a novel cache-based architecture to support TLS in multi-threaded SMT architecture. A detailed study on the efficiency of different TLS architectures was conducted by comparing their performance, power and thermal characteristics. To improve efficiency, the dissertation proposes a novel SMT-CMP based heterogeneous architecture which combines the advantages of both SMT and CMP architectures. The dissertation also proposes novel architecture and compiler techniques to efficiently extract speculative parallelism from multiple loop levels.
机译:随着多线程(例如,同时多线程(SMT)[1,2])和/或多核(例如,芯片多处理器(CMP)[3,4])体系结构的出现,现在的挑战是利用这些用于提高通用应用程序性能的体系结构。但是,传统的并行化编译器通常无法有效地并行化通常具有复杂控制流和过多指针使用情况的通用应用程序。已提出使用线程级推测(TLS)来简化使用推测线程的并行化任务。尽管过去已经对TLS的性能进行了研究,但对TLS的功耗,功率效率和热行为却知之甚少。同样,以前关于TLS的工作主要集中在基于多核的体系结构上,而在支持多线程体系结构上的TLS方面所做的工作相对较少。随着多线程/多核设计选择的增加,理解不同类型的体系结构的好处非常重要。本论文的目的是探索可在未来的多线程/多核中有效实现TLS的体系结构技术。处理器。本文提出了一种新颖的基于缓存的架构,以支持多线程SMT架构中的TLS。通过比较它们的性能,功率和散热特性,对不同TLS架构的效率进行了详细研究。为了提高效率,本文提出了一种新颖的基于SMT-CMP的异构架构,该架构结合了SMT和CMP架构的优点。论文还提出了新颖的体系结构和编译器技术,可以有效地从多个循环级别提取推测性并行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号