首页> 外文会议>International Conference on VLSI Design >Evaluation of speed and area of clustered VLIW processors
【24h】

Evaluation of speed and area of clustered VLIW processors

机译:聚类VLIW处理器速度和面积评估

获取原文

摘要

Splitting a wide-issue VLIW processor in clusters decreases the clock period, area and power consumption. Previous studies of the physical benefits from clustering focused on the scalability of the register file, based on speculative analytical models. In contrast, we evaluate speed and area of the whole VLIW datapath, including the register files, FUs, and bypasses through realistic physical layout experiments. Our baseline is an optimized for speed 8-issue-slot VLIW pipeline, derived from a commercial media processor. Despite the frequent prior-art assumption that the register file defines the clock frequency of a clustered VLIW processor, we discovered it is the FU bypass network that limits the clock speed. After a drastic 1.75 clock frequency speedup from clustering the unicluster into two clusters, subsequent clustering brings modest 2.05 and 2.17 speedups for the 4- and 8-cluster VLIWs, respectively. Combined with cycle count increase trends due to clustering, this leads to the conclusion that excessive clustering does not speed up the processor. Furthermore, clustering reduces area of the VLIW datapath. In our experiments the area savings due to clustering reach 14.2%, 43.5% and 50.5% for the 2-, 4-, and 8-cluster VLIWs, respectively.
机译:分割成簇宽问题VLIW处理器减小所述时钟周期,面积和功率消耗。从聚类身体的好处以往的研究主要集中在寄存器文件的可扩展性,以投机的分析模型。相比之下,我们评估的速度和整个VLIW数据路径的区域,包括寄存器文件,滤波单元,并通过逼真的物理布局实验旁路。我们的基线的速度8问题槽VLIW管线优化的,从商用媒体处理器的。尽管频繁现有技术假设寄存器文件定义群集的VLIW处理器的时钟频率,我们发现它是FU支路网络限制时钟速度。从聚类unicluster成两个集群的急剧1.75时钟频率的加速之后,后续的聚类分别带来适度2.05和2.17的加速为4和8簇的VLIW。与周期数增加趋势,因聚集,这导致的结论是,过度的群集不加速处理器相结合。此外,集群降低了VLIW数据路径的区域。在我们的实验中面积节省由于分别聚类达到14.2%,43.5%和50.5%的2-,4-,和8-簇的VLIW。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号