A Hybrid MPI-OpenMP Implementation of an Implicit Finite-Element Code on Parallel Architectures

作者: G. Mahinthakumar , F. Saied

DOI: 10.1177/109434200201600402

关键词: LOOP (programming language)Process (computing)Parallel computingDecomposition (computer science)Computer scienceDistributed memoryCode (cryptography)Message Passing InterfaceFinite element codeMultiprocessing

摘要: SummaryThe hybrid MPI-OpenMP model is a natural parallel programming paradigm for emerging architectures that are based on symmetric multiprocessor (SMP) clusters. This paper presents implementation adapted an implicit finite-element code developed groundwater transport simulations. The original was parallelized distributed memory using MPI (Message Passing Interface) domain decomposition strategy. OpenMP directives were then added to the (a straightforward loop-level implementation) use multiple threads within each process. To improve performance, several loop modifications adopted. performance results compared four modern architectures. show most of cases tested, pure approach outperforms model. exceptions this observation mainly due limitation in library one A general conclusion ...

参考文章(10)
George Karypis, Ananth Grama, Vipin Kumar, Anshul Gupta, Introduction to parallel computing: design and analysis of algorithms Benjamin-Cummings Publishing Co., Inc.. ,(1994)
Steve W. Bova, Clay P. Breshears, Christine E. Cuicchi, Zeki Demirbilek, Henry A. Gabb, Dual-Level Parallel Analysis of Harbor Wave Response Using MPI and OpenMP ieee international conference on high performance computing data and analytics. ,vol. 14, pp. 49- 64 ,(2000) , 10.1177/109434200001400104
Alan J. Wallcraft, SPMD OpenMP versus MPI for ocean models Concurrency and Computation: Practice and Experience. ,vol. 12, pp. 1155- 1164 ,(2000) , 10.1002/1096-9128(200010)12:12<1155::AID-CPE532>3.0.CO;2-5
Glenn R. Luecke, Wei-Hua Lin, Scalability and performance of OpenMP and MPI on a 128‐processor SGI Origin 2000 Concurrency and Computation: Practice and Experience. ,vol. 13, pp. 905- 928 ,(2001) , 10.1002/CPE.588
Lorna Smith, Paul Kent, Development and performance of a mixed OpenMP/MPI quantum Monte Carlo code Concurrency and Computation: Practice and Experience. ,vol. 12, pp. 1121- 1129 ,(2000) , 10.1002/1096-9128(200010)12:12<1121::AID-CPE531>3.0.CO;2-N
Jay Hoeflinger, Prasad Alavilli, Thomas Jackson, Bob Kuhn, Producing scalable performance with OpenMP: experiments with two CFD applications parallel computing. ,vol. 27, pp. 391- 413 ,(2001) , 10.1016/S0167-8191(00)00071-5
G. Mahinthakumar, J. P. Gwo, Gerilynn R. Moline, Oren F. Webb, Subsurface Biological Activity Zone Detection Using Genetic Search Algorithms Journal of Environmental Engineering. ,vol. 125, pp. 1103- 1112 ,(1999) , 10.1061/(ASCE)0733-9372(1999)125:12(1103)
Kamal C. Sarma, Hojjat Adeli, Bilevel Parallel Genetic Algorithms for Optimization of Large Steel Structures Computer-aided Civil and Infrastructure Engineering. ,vol. 16, pp. 295- 304 ,(2001) , 10.1111/0885-9507.00234
David Mackay, G. Mahinthakumar, Ed D'Azevedo, A Study of I/o in a pArallel Finite Element Groundwater Transport Code ieee international conference on high performance computing data and analytics. ,vol. 12, pp. 307- 319 ,(1998) , 10.1177/109434209801200301