首页> 外文会议>International Conference on VLSI Design >Synthesis of application-specific heterogeneous multiprocessor architectures using extensible processors
【24h】

Synthesis of application-specific heterogeneous multiprocessor architectures using extensible processors

机译:使用可扩展处理器综合应用程序特定的异构多处理器架构

获取原文

摘要

Nanometer fabrication technologies have made it feasible to integrate multiple processors on a single chip. Heterogeneous multiprocessor systems-on-chip (MPSoCs), in which different processors are customized for specific tasks, can provide high levels of efficiency in performance and power consumption, while maintaining programmability. However, in order to best exploit processor heterogeneity, designers are still required to manually customize each processor, while mapping the application tasks to them, so that the overall performance and/or power requirements are satisfied. In this paper, we propose a methodology to automatically synthesize a custom (heterogeneous) architecture, consisting of multiple extensible processors, to best speed up a given application. Our methodology simultaneously customizes the instruction set of, and assigns application tasks to, each processor in the multiprocessor system, while scheduling their execution. We motivate the need for such an integrated approach by demonstrating that custom instruction selection has complex interdependencies with task assignment and scheduling, and performing these steps independently often results in significant degradation in the quality of the synthesized multiprocessor architecture. Our methodology uses an iterative improvement algorithm to assign and schedule tasks on processors and select custom instructions along the critical path in an interleaved manner. It utilizes the concept of "expected execution time" to better integrate these two steps. It not only considers the currently selected custom instructions for the current task assignment and schedule, but also the possibility of better custom instructions being selected in future iterations. We also enhance our methodology to integrate task-level software pipelining to further increase the parallelism and provide opportunities for multiprocessing. We have implemented the proposed heterogeneous multiprocessor synthesis methodology in the context of a commercial extensible processor design flow using the Xtensa/spl trade/ platform from Tensilica Inc. We have evaluated our tool by automatically generating custom multiprocessor architectures for several complex embedded software benchmarks. The results show that architectures synthesized by the proposed methodology demonstrate an average speedup of 2.0 /spl times/ (up to 3.1 /spl times/) compared to symmetric multiprocessor architectures in which the processors have not been augmented with custom instructions. To the best of our knowledge, this is the first tool for the synthesis of custom MPSoCs using extensible processors.
机译:纳米制造技术使得将多个处理器集成在单个芯片上的可行性。异构多处理器系统的片上系统(MPSOC),其中不同的处理器用于特定任务,可以在性能和功耗中提供高水平的效率,同时保持可编程性。但是,为了最佳利用处理器异质性,设计人员仍然需要手动自定义每个处理器,同时将应用程序任务映射到它们,从而满足整体性能和/或电源要求。在本文中,我们提出了一种方法来自动综合由多个可扩展处理器组成的自定义(异构)架构,以最佳加快给定的应用程序。我们的方法同时自定义了指令集,并将应用程序任务分配给多处理器系统中的每个处理器,同时调度它们的执行。我们通过展示自定义指令选择具有复杂的相互依赖性,与任务分配和调度具有复杂的相互依赖性,并且独立地执行这些步骤的需求,并且常常导致合成的多处理器架构的质量显着降低。我们的方法使用迭代改进算法在处理器上分配和调度任务,并以交错方式选择沿着关键路径的自定义指令。它利用“预期执行时间”的概念来更好地整合这两个步骤。它不仅考虑当前任务分配和计划的当前所选的自定义说明,还考虑了在将来迭代中选择更好的自定义指令的可能性。我们还增强了我们的方法,以集成任务级软件流水线,以进一步增加并行性并为多处理提供机会。我们在使用Tensilica Inc的XTensa / SPL贸易/平台的商业可扩展处理器设计流程中实现了所提出的异构多处理器合成方法。我们通过自动为几个复杂的嵌入式软件基准测试自动生成自定义多处理器架构来评估我们的工具。结果表明,由所提出的方法合成的架构展示了与对称多处理器架构相比的2.0 / SPL时间/(最多3.1 / SPL时间/)的平均速度,其中处理器尚未使用自定义说明进行增强。据我们所知,这是使用可扩展处理器合成自定义MPSOC的第一个工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号