Assembly Operations for Multicore Architectures using Task-Based Runtime Systems

作者: Damien Genet , Abdou Guermouche , George Bosilca

DOI: 10.1007/978-3-319-14313-2_29

关键词: Scheduling (computing)Parallel algorithmLinear algebraComputer scienceRuntime systemParallel computingFinite element methodDirected acyclic graphMatrix (mathematics)Multi-core processor

摘要: Traditionally, numerical simulations based on finite element methods consider the algorithm as being divided in three major steps: generation of a set blocks and vectors, assembly these matrix big vector, inversion matrix. In this paper we tackle second step, block assembly, where no parallel is widely available. Several strategies are proposed to decompose problem while relying scheduling middle-ware maximize overlap between stages increase parallelism thus performance. These quantified using examples covering two extremes field, large number non-overlapping small for CFD-like problems, smaller larger with significant which can be met sparse linear algebra solvers.

参考文章(18)
George Bosilca, None, Dense Linear Algebra on Distributed Heterogeneous Hardware with a Symbolic DAG Approach Scalable Computing and Communications: Theory and Practice (to appear). ,(2012)
Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Florent Lopez, Multifrontal QR Factorization for Multicore Architectures over Runtime Systems Euro-Par 2013 Parallel Processing. ,vol. 8097, pp. 521- 532 ,(2013) , 10.1007/978-3-642-40047-6_53
Enrique S. Quintana-Ortí, Gregorio Quintana-Ortí, Jesús Labarta, José R. Herrero, Josep M. Pérez, Rosa M. Badia, Parallelizing dense and banded linear algebra libraries using SMPSs Concurrency and Computation: Practice and Experience. ,vol. 21, pp. 2438- 2456 ,(2009) , 10.1002/CPE.V21:18
G. R. Markall, A. Slemmer, D. A. Ham, P. H. J. Kelly, C. D. Cantwell, S. J. Sherwin, Finite element assembly strategies on multi-core and many-core architectures International Journal for Numerical Methods in Fluids. ,vol. 71, pp. 80- 97 ,(2013) , 10.1002/FLD.3648
Patrick R. Amestoy, Abdou Guermouche, Jean-Yves L’Excellent, Stéphane Pralet, Hybrid scheduling for the parallel solution of linear systems parallel computing. ,vol. 32, pp. 136- 156 ,(2006) , 10.1016/J.PARCO.2005.07.004
Gregorio Quintana-Ortí, Enrique S. Quintana-Ortí, Robert A. Van De Geijn, Field G. Van Zee, Ernie Chan, Programming matrix algorithms-by-blocks for thread-level parallelism ACM Transactions on Mathematical Software. ,vol. 36, pp. 1- 26 ,(2009) , 10.1145/1527286.1527288
Peter Huthwaite, Accelerated finite element elastodynamic simulations using the GPU Journal of Computational Physics. ,vol. 257, pp. 687- 707 ,(2014) , 10.1016/J.JCP.2013.10.017
Emmanuel Agullo, Jim Demmel, Jack Dongarra, Bilel Hadri, Jakub Kurzak, Julien Langou, Hatem Ltaief, Piotr Luszczek, Stanimire Tomov, Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects Journal of Physics: Conference Series. ,vol. 180, pp. 012037- ,(2009) , 10.1088/1742-6596/180/1/012037
Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Eric Darve, Matthias Messner, Toru Takahashi, Task-Based FMM for Multicore Architectures SIAM Journal on Scientific Computing. ,vol. 36, pp. 33- ,(2014) , 10.1137/130915662
Nina Hanzlikova, Eduardo Rocha Rodrigues, A novel finite element method assembler for co-processors and accelerators irregular applications: architectures and algorithms. pp. 1- 8 ,(2013) , 10.1145/2535753.2535754