Sparse Matrix-Vector Multiplication on GPGPUs

作者: Salvatore Filippone , Valeria Cardellini , Davide Barbieri , Alessandro Fanfarillo

DOI: 10.1145/3017994

关键词:

摘要: The multiplication of a sparse matrix by dense vector (SpMV) is centerpiece scientific computing applications: it the essential kernel for solution linear systems and eigenvalue problems iterative methods. efficient implementation matrix-vector therefore crucial has been subject an immense amount research, with interest renewed every major new trend in high-performance architectures. introduction General-Purpose Graphics Processing Units (GPGPUs) no exception, many articles have devoted to this problem.With article, we provide review techniques implementing SpMV on GPGPUs that appeared literature last few years. We discuss issues tradeoffs encountered various researchers, list solutions, organized categories according common features. also performance comparison across different GPGPU models set test matrices coming from application domains.

参考文章(115)
Yuji Kubota, Daisuke Takahashi, Optimization of Sparse Matrix-Vector Multiplication by Auto Selecting Storage Schemes on GPU Computational Science and Its Applications - ICCSA 2011. pp. 547- 561 ,(2011) , 10.1007/978-3-642-21887-3_42
Daichi Mukunoki, Daisuke Takahashi, Optimization of Sparse Matrix-Vector Multiplication for CRS Format on NVIDIA Kepler Architecture GPUs Lecture Notes in Computer Science. pp. 211- 223 ,(2013) , 10.1007/978-3-642-39640-3_15
J. Dongarra, J. Demmel, A. Petitet, I. Dhillon, D. Walker, R. C. Whaley, S. Ostrouchov, J. Choi, K. Staney, LAPACK Working Note 95: ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers -- Design Issues and Performance University of Tennessee. ,(1995)
Alessandro Fanfarillo, Davide Barbieri, Valeria Cardellini, Salvatore Filippone, Three storage formats for sparse matrices on GPGPUs hgpu.org. ,(2015)
Yongchao Liu, Bertil Schmidt, LightSpMV: Faster CSR-based sparse matrix-vector multiplication on CUDA-enabled GPUs application-specific systems, architectures, and processors. pp. 82- 89 ,(2015) , 10.1109/ASAP.2015.7245713
Ali Cevahir, Akira Nukada, Satoshi Matsuoka, Fast Conjugate Gradients with Multiple GPUs international conference on computational science. pp. 893- 903 ,(2009) , 10.1007/978-3-642-01970-8_90