首页> 外文会议>International Symposium on Parallel and Distributed Computing >Parabilis: Speeding up Single-Threaded Applications by Extracting Fine-Grained Threads for Multi-core Execution
【24h】

Parabilis: Speeding up Single-Threaded Applications by Extracting Fine-Grained Threads for Multi-core Execution

机译:parabilis:通过提取用于多核执行的细粒度线程来加速单线程应用程序

获取原文

摘要

The trend in architectural designs has been towards using simple cores for building multicore chips, instead of a single complex out-of-order (OOO) core, due to the increased complexity and energy requirements of out of order processors. Multicore chips provide better performance when compared with OOO cores while executing parallel applications. However, they are not able to exploit the parallelism inherent in single threaded applications. To this end, this paper presents a compiler optimization methodology coupled with minimal hardware extensions to extract simple fine-grained threads from a single-threaded application, for execution on multiple cores of a chip multiprocessor (CMP). These fine-grained threads are independent and eliminate the need for communication between cores, reducing costly communication latencies. This approach, which we call Parabilis is scalable for up to eight cores, and does not require complex hardware additions to simple multicore systems. Our evaluation shows that Parabilis yields an average speedup of 1.51 on an 8-core CMP architecture.
机译:建筑设计的趋势一直在朝着建筑多核芯片的简单核心,而不是单一复杂的超出订单(OOO)核心,这是由于订单处理器的复杂性和能量要求增加了。与OOO核心相比,多核芯片在执行并行应用程序时提供更好的性能。但是,它们无法利用单线程应用中固有的并行性。为此,本文介绍了一个编译器优化方法,其与最小硬件扩展耦合,以从单线程应用中提取简单的细粒度线程,用于在芯片多处理器(CMP)的多个内核上执行。这些细粒线是独立的,并消除核心之间的通信需求,降低了昂贵的通信延迟。我们称之为Parabilis的这种方法可扩展到最多八个核心,并且不需要复杂的硬件添加到简单的多核系统。我们的评价表明,Parabilis在8核CMP架构上产生1.51的平均加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号