...
首页> 外文期刊>IEICE transactions on information and systems >Performance Optimization for Sparse A(t)Ax in Parallel on Multicore CPU
【24h】

Performance Optimization for Sparse A(t)Ax in Parallel on Multicore CPU

机译:Performance Optimization for Sparse A(t)Ax in Parallel on Multicore CPU

获取原文
获取原文并翻译 | 示例
           

摘要

The sparse matrix operation, y - y+A(t)Ax, where A is a sparse matrix and x and y are dense vectors, is a widely used computing pattern in High Performance Computing (HPC) applications. The pattern poses challenge to efficient solutions because both a matrix and its transposed version are involved. An efficient sparse matrix format, Compressed Sparse Blocks (CSB), has been proposed to provide nearly the same performance for both Ax and A(t)x. We develop a multithreaded implementation for the CSB format and apply it to solve y - y+A(t)Ax. Experiments show that our technique outperforms the Compressed Sparse Row (CSR) based solution in POSKI by up to 2.5 fold on over 70 of benchmarking matrices.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号