首页> 外文期刊>Journal of supercomputing >Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors
【24h】

Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors

机译:高度瘦的QR QR分解,具有在图形处理器上的近似家庭式反射器

获取原文
获取原文并翻译 | 示例
           

摘要

We present a novel method for the QR factorization of large tall-and-skinny matrices that introduces an approximation technique for computing the Householder vectors.This approach is very competitive on a hybrid platform equipped with a graphics processor, with a performance advantage over the conventional factorization due to the reduced amount of data transfers between the graphics accelerator and the main memory of the host. Our experiments show that, for tall-skinny matrices, the new approach outperforms the code in MAGMA by a large margin, while it is very competitive for square matrices when the memory transfers and CPU computations are the bottleneck of the Householder QR factorization.
机译:我们提出了一种关于大型高瘦矩阵的QR分解的新方法,它引入了计算住户向量的近似技术。这种方法在配备图形处理器的混合平台上具有非常竞争力的方法,具有通过传统的性能优势由于图形加速器之间的数据传输量减少和主机的主存储器而导致的分解。我们的实验表明,对于高瘦矩阵,新方法通过大幅度优于岩浆中的代码,而当存储器转移和CPU计算是家庭接管QR分解的瓶颈时,方形矩阵非常竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号