Streaming sparse matrix compression/decompression

作者: David Moloney , Dermot Geraghty , Colm McSweeney , Ciaran McElroy

DOI: 10.1007/11587514_9

关键词:

摘要: A streaming floating-point sparse-matrix compression which forms a key element of an accelerator for finite-element and other linear algebra applications is described. The proposed architecture seeks to accelerate the performance-limiting Sparse Matrix-Vector Multiplication (SMVM) operation at heart through combination dedicated datapath optimized these with data-compression decompression unit increases effective memory bandwidth seen by datapath. format uses variable length entries contain opcode optionally address and/or non-zero entry. System simulations performed using cycle-accurate C++ architectural model database over 400 large symmetric unsymmetric matrices containing up 20M elements (and total 226M non-zeroes) demonstrate that 20% average performance improvement can be achieved compared published work, modest increase in hardware resources.

参考文章(17)
J.G. Siek, A. Lumsdaine, The Matrix Template Library: generic components for high-performance scientific computing Computing in Science and Engineering. ,vol. 1, pp. 70- 78 ,(1999) , 10.1109/5992.805137
Martin Isenburg, Peter Lindstrom, Jack Snoeyink, Lossless Compression of Floating-Point Geometry Computer-Aided Design and Applications. ,vol. 1, pp. 495- 501 ,(2004) , 10.1080/16864360.2004.10738292
Iain S Duff, Albert M Erisman, John K Reid, Direct Methods for Sparse Matrices ,(1987)
A. Wolfe, M. Breternitz, C. Stephens, A. L. Ting, D. B. Kirk, R. P. Bianchini, J. P. Shen, The White Dwarf: a high-performance application-specific processor international symposium on computer architecture. ,vol. 16, pp. 212- 222 ,(1988) , 10.1145/633625.52425
Pyrrhos Stathis, Stamatis Vassiliadis, Sorin Cotofana, D-SAB: A Sparse Matrix Benchmark Suite parallel computing technologies. pp. 549- 554 ,(2003) , 10.1007/978-3-540-45145-7_52
V.E. Taylor, A. Ranade, D.G. Messerschmitt, SPAR: a new architecture for large finite element computations IEEE Transactions on Computers. ,vol. 44, pp. 531- 545 ,(1995) , 10.1109/12.376168
W. K. Anderson, W. D. Gropp, D. K. Kaushik, D. E. Keyes, B. F. Smith, Achieving High Sustained Performance in an Unstructured Mesh CFD Application conference on high performance computing (supercomputing). pp. 69- 69 ,(1999) , 10.1145/331532.331600
P. Stathis, S. Vassiliadis, S. Cotofana, A Hierarchical sparse matrix storage format for vector processors international parallel and distributed processing symposium. pp. 61- ,(2003) , 10.1109/IPDPS.2003.1213156
B. Jacob, A case for studying DRAM issues at the system level IEEE Micro. ,vol. 23, pp. 44- 56 ,(2003) , 10.1109/MM.2003.1225969