首页> 外文会议>Workshop on Cloud Computing Projects and Initiatives >Column-Based Matrix Partitioning for Parallel Matrix Multiplication on Heterogeneous Processors Based on Functional Performance Models
【24h】

Column-Based Matrix Partitioning for Parallel Matrix Multiplication on Heterogeneous Processors Based on Functional Performance Models

机译:基于列的基于矩阵乘法的基于矩阵乘法基于功能性能模型的异构处理器

获取原文

摘要

In this paper we present a new data partitioning algorithm to improve the performance of parallel matrix multiplication of dense square matrices on heterogeneous clusters. Existing algorithms either use single speed performance models which are too simplistic or they do not attempt to minimised the total volume of communication. The functional performance model (FPM) is more realistic then single speed models because it integrates many important features of heterogeneous processors such as the processor heterogeneity, the heterogeneity of memory structure, and the effects of paging. To load balance the computations the new algorithm uses FPMs to compute the area of the rectangle that is assigned to each processor. The total volume of communication is then minimised by choosing a shape and ordering so that the sum of the half-perimeters is minimised. Experimental results demonstrate that this new algorithm can reduce the total execution time of parallel matrix multiplication in comparison to existing algorithms.
机译:本文介绍了一种新的数据分区算法,提高了密集方矩阵对异构簇的平行矩阵乘法的性能。现有的算法使用单速性能模型,这些模型太简单,或者它们不会尝试最小化通信总量。功能性能模型(FPM)是更逼真的,然后是单速模型,因为它集成了异质处理器的许多重要特征,例如处理器异质性,存储器结构的异构性和寻呼的效果。要加载余额,该计算新算法使用FPM来计算分配给每个处理器的矩形区域。然后通过选择形状和排序来最小化通信总量,使得半周周的总和最小化。实验结果表明,与现有算法相比,该新算法可以减少并行矩阵乘法的总执行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号