首页> 外文期刊>Computers & Digital Techniques, IET >Advanced architecture optimisation and performance analysis of a reconfigurable grid ALU processor
【24h】

Advanced architecture optimisation and performance analysis of a reconfigurable grid ALU processor

机译:可重构网格ALU处理器的高级架构优化和性能分析

获取原文
获取原文并翻译 | 示例
       

摘要

In the billion transistor era only a few architectural approaches propose new paths to improve the execution of conventional sequential instruction streams. Many legacy applications could profit from processors that are able to speed-up the execution of sequential applications beyond the performance of current superscalar processors. The Grid arithmetic logic unit (ALU) Processor (GAP) accelerates conventional sequential instruction streams without the need for recompilation. The GAP comprises a processor front-end similar to that of a superscalar processor extended by a configuration unit and a twodimensional array of functional units that forms the execution unit. Instruction sequences are mapped dynamically into the array by the configuration unit so that they form the dataflow graph of the sequence. This study shows a performance evaluation of the GAP architecture with different array dimensions as well as its performance using a simplified interconnection network. GAP outperforms an out-of-order superscalar processor by a maximum of factor 2 with a complete crossbar interconnect between two array rows. Reducing the interconnection network to the minimum shows a maximum performance drawback of 10% for only a particular configuration and a single benchmark. In general, the slowdown is less than 2% for the minimum interconnect (two buses) and about 0.02% if three interconnection buses are used.
机译:在十亿个晶体管时代,只有很少的架构方法提出了新的途径来改善常规顺序指令流的执行。许多传统应用程序可以从处理器中受益,这些处理器能够加快顺序应用程序的执行速度,超过当前超标量处理器的性能。网格算术逻辑单元(ALU)处理器(GAP)无需重新编译即可加速常规顺序指令流。该GAP包括类似于由配置单元和形成执行单元的功能单元的二维阵列扩展的超标量处理器的处理器前端。指令序列由配置单元动态映射到阵列中,以便它们形成序列的数据流图。这项研究显示了具有不同阵列尺寸的GAP架构的性能评估,以及使用简化互连网络的性能评估。在两个阵列行之间具有完整的交叉开关互连的情况下,GAP的性能优于无序超标量处理器的最大因数2。仅将特定配置和单个基准降低到最低程度,即可显示出10%的最大性能缺陷。通常,最小互连(两条总线)的速度下降小于2%,如果使用三个互连总线,则速度下降约0.02%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号