Register-based Implementation of the Sparse General Matrix-Matrix Multiplication on GPUs

Junhong Liu; Xin He; Weifeng Liu; Guangming Tan

首页> 外文期刊>ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages >Register-based Implementation of the Sparse General Matrix-Matrix Multiplication on GPUs

【24h】

Register-based Implementation of the Sparse General Matrix-Matrix Multiplication on GPUs

机译：基于寄存器的GPU稀疏常规矩阵矩阵乘法的实现

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

General sparse matrix-matrix multiplication (SpGEMM) is an essential building block in a number of applications. In our work, we fully utilize GPU registers and shared memory to implement an efficient and load balanced SpGEMM in comparison with the existing implementations.

机译：一般稀疏矩阵矩阵乘法（SPGEMM）是许多应用中的基本构建块。在我们的工作中，我们充分利用GPU寄存器和共享内存，与现有实现相比，实现有效和负载平衡的SPGEMM。

著录项

来源
《ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages》 |2018年第1期|共2页
作者
Junhong Liu; Xin He; Weifeng Liu; Guangming Tan;
展开▼
作者单位

?State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences;

?State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences;

Department of Computer Science Norwegian University of Science and Technology;

?State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
multiplication; efficient; comparison;

机译：乘法;高效;比较;

相似文献

外文文献
中文文献
专利

1. Register-based Implementation of the Sparse General Matrix-Matrix Multiplication on GPUs [J] . Junhong Liu, Xin He, Weifeng Liu, ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2018,第1期

机译：基于寄存器的GPU稀疏常规矩阵矩阵乘法的实现
2. Multithreaded sparse matrix-matrix multiplication for many-core and GPU architectures [J] . Deveci Mehmet, Trott Christian, Rajamanickam Sivasankaran Parallel Computing . 2018,第octa期

机译：适用于多核和GPU架构的多线程稀疏矩阵矩阵乘法
3. Optimizing Sparse Matrix-Matrix Multiplication for the GPU [J] . Dalton Steven, Olson Luke, Bell Nathan ACM transactions on mathematical software . 2015,第4期

机译：为GPU优化稀疏矩阵-矩阵乘法
4. Efficient Sparse-Dense Matrix-Matrix Multiplication on GPUs Using the Customized Sparse Storage Format [C] . Shaohuai Shi, Qiang Wang, Xiaowen Chu IEEE International Conference on Parallel and Distributed Systems . 2020

机译：使用定制稀疏存储格式的高效稀疏密集矩阵矩阵乘法
5. Optimizing Tall-and-skinny Matrix-matrix Multiplication on GPUs [D] . Xiong, Nan 2018

机译：在GPU上优化高而瘦的矩阵矩阵乘法
6. Molecular Dynamics Simulations Using the Drude Polarizable Force Field on GPUs with OpenMM: Implementation Validation and Benchmarks [O] . Jing Huang, Justin A. Lemkul, Peter K. Eastman, -1

机译：在带有OpenMM的GPU上使用Drude可极化力场的分子动力学模拟：实现验证和基准
7. Efficient Sparse-Dense Matrix-Matrix Multiplication on GPUs Using the Customized Sparse Storage Format [O] . Shaohuai Shi, Qiang Wang, Xiaowen Chu 2020

机译：使用定制稀疏存储格式的高效稀疏密集矩阵矩阵乘法

Register-based Implementation of the Sparse General Matrix-Matrix Multiplication on GPUs

摘要

著录项

相似文献

相关主题

期刊订阅