作者: J.C. Pichel , D.B. Heras , J.C. Cabaleiro , F.F. Rivera
DOI: 10.1109/EMPDP.2004.1271429
关键词: Matrix-free methods 、 Matrix multiplication 、 Shared memory 、 Row and column spaces 、 Sparse matrix 、 Sparse approximation 、 Computer science 、 Parallel computing 、 Locality 、 Set (abstract data type)
摘要: We extend a model of locality and the subsequent process improvement previously developed for case sparse algebra codes in monoprocessors to NUMA shared memory multiprocessors (SMPs). In particular product matrix by dense vector (SpM/spl times/V) is studied. model, established at run-time considering parameters that describe structure involved computations. The problem increasing formulated as graph problem, whose solution indicates some appropriate reordering rows columns matrix. algorithms were tested broad set matrices. have also performed comparison with other algorithms. results lead general conclusions about improving SMP performance codes.