作者: Muthu M. Baskaran , Rajesh J. Bordawekar
DOI:
关键词:
摘要: Techniques for optimizing sparse matrix-vector multiplication (SpMV) on a graphics processing unit (GPU) are provided. The techniques include receiving multiplication, analyzing the to identify one or more optimizations, wherein optimizations comprises non-zero pattern and determining whether is be reused across computation, global memory access, shared access exploiting reuse parallelism, outputting an optimized multiplication.