Assembly Operations for Multicore Architectures using Task-Based Runtime Systems

作者： Damien Genet , Abdou Guermouche , George Bosilca

关键词: Scheduling (computing) 、 Parallel algorithm 、 Linear algebra 、 Computer science 、 Runtime system 、 Parallel computing 、 Finite element method 、 Directed acyclic graph 、 Matrix (mathematics) 、 Multi-core processor

摘要: Traditionally, numerical simulations based on finite element methods consider the algorithm as being divided in three major steps: generation of a set blocks and vectors, assembly these matrix big vector, inversion matrix. In this paper we tackle second step, block assembly, where no parallel is widely available. Several strategies are proposed to decompose problem while relying scheduling middle-ware maximize overlap between stages increase parallelism thus performance. These quantified using examples covering two extremes field, large number non-overlapping small for CFD-like problems, smaller larger with significant which can be met sparse linear algebra solvers.

参考文章(18)

George Bosilca, None, Dense Linear Algebra on Distributed Heterogeneous Hardware with a Symbolic DAG Approach Scalable Computing and Communications: Theory and Practice (to appear). ,(2012)

Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Florent Lopez, Multifrontal QR Factorization for Multicore Architectures over Runtime Systems Euro-Par 2013 Parallel Processing. ,vol. 8097, pp. 521- 532 ,(2013) , 10.1007/978-3-642-40047-6_53

Enrique S. Quintana-Ortí, Gregorio Quintana-Ortí, Jesús Labarta, José R. Herrero, Josep M. Pérez, Rosa M. Badia, Parallelizing dense and banded linear algebra libraries using SMPSs Concurrency and Computation: Practice and Experience. ,vol. 21, pp. 2438- 2456 ,(2009) , 10.1002/CPE.V21:18

G. R. Markall, A. Slemmer, D. A. Ham, P. H. J. Kelly, C. D. Cantwell, S. J. Sherwin, Finite element assembly strategies on multi-core and many-core architectures International Journal for Numerical Methods in Fluids. ,vol. 71, pp. 80- 97 ,(2013) , 10.1002/FLD.3648

Patrick R. Amestoy, Abdou Guermouche, Jean-Yves L’Excellent, Stéphane Pralet, Hybrid scheduling for the parallel solution of linear systems parallel computing. ,vol. 32, pp. 136- 156 ,(2006) , 10.1016/J.PARCO.2005.07.004

Gregorio Quintana-Ortí, Enrique S. Quintana-Ortí, Robert A. Van De Geijn, Field G. Van Zee, Ernie Chan, Programming matrix algorithms-by-blocks for thread-level parallelism ACM Transactions on Mathematical Software. ,vol. 36, pp. 1- 26 ,(2009) , 10.1145/1527286.1527288

Peter Huthwaite, Accelerated finite element elastodynamic simulations using the GPU Journal of Computational Physics. ,vol. 257, pp. 687- 707 ,(2014) , 10.1016/J.JCP.2013.10.017

Emmanuel Agullo, Jim Demmel, Jack Dongarra, Bilel Hadri, Jakub Kurzak, Julien Langou, Hatem Ltaief, Piotr Luszczek, Stanimire Tomov, Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects Journal of Physics: Conference Series. ,vol. 180, pp. 012037- ,(2009) , 10.1088/1742-6596/180/1/012037

Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Eric Darve, Matthias Messner, Toru Takahashi, Task-Based FMM for Multicore Architectures SIAM Journal on Scientific Computing. ,vol. 36, pp. 33- ,(2014) , 10.1137/130915662

10.

Nina Hanzlikova, Eduardo Rocha Rodrigues, A novel finite element method assembler for co-processors and accelerators irregular applications: architectures and algorithms. pp. 1- 8 ,(2013) , 10.1145/2535753.2535754

Assembly Operations for Multicore Architectures using Task-Based Runtime Systems

来源期刊

我的账户

Assembly Operations for Multicore Architectures using Task-Based Runtime Systems

来源期刊

相似文章 4

Design of Generic Modular Solutions for PDE solvers for Modern Architectures

An approach to the parallel assembly of the stiffness matrix in elastoplastic problems

Vectorization of a spectral finite-element numerical kernel

Iteratively solving sparse linear system based on PaRSEC task scheduling

我的账户