Performance Evaluation of Hybrid Parallel Programming Paradigms

作者: Achal Prabhakar , Vladimir Getov

DOI: 10.1007/978-1-4615-0361-3_4

关键词: Parallel computingDistributed computingSuiteHomogeneousKey (cryptography)Computer scienceIBMComputationSupercomputerHybrid programmingPerformance results

摘要: With the trend in supercomputing world shifting from homogeneous machine architectures to hybrid clusters of SMP nodes, interoperabiility OpenMP and MPI has become a key issue understanding optimizing overall system performance. While low-level performance can be evaluated using existing benchmarks, combination two poses new challenges. Therefore, study different programming paradigms is high benefit for both vendors user community. As part our project, we have identified several possible combinations models order provide qualitative quantitative justification situations which any one them favoured. Collective operations are particularly important analyze evaluate on platform therefore concentrate three -- barrier, all-to-all, all-reduce. Issues like optimal mix MPI, most efficient way managing communication within OpenMP, unit communication, degree overlap between computation need evaluated. The results supporting this investigation were taken IBM Power-3 at San Diego Supecomputer Center suite microbenchmarks.

参考文章(8)
Ralf Reussner, Peter Sanders, Lutz Prechelt, Matthias Müller, SKaMPI: A Detailed, Accurate MPI Benchmark european pvm mpi users group meeting on recent advances in parallel virtual machine and message passing interface. pp. 52- 59 ,(1998) , 10.1007/BFB0056559
Lutz Prechelt, Peter Sanders, Ralf Reussner, Matthias Müller, SKaMPI: A Detailed, Accurate MPI Benchmark Untitled Event. pp. 52- 59 ,(1998)
Géraud Krawezik, Guillaume Alléon, Franck Cappello, SPMD OpenMP versus MPI on a IBM SMP for 3 Kernels of the NAS Benchmarks ieee international conference on high performance computing data and analytics. pp. 425- 436 ,(2002) , 10.1007/3-540-47847-7_39
Lorna Smith, Paul Kent, Development and performance of a mixed OpenMP/MPI quantum Monte Carlo code Concurrency and Computation: Practice and Experience. ,vol. 12, pp. 1121- 1129 ,(2000) , 10.1002/1096-9128(200010)12:12<1121::AID-CPE531>3.0.CO;2-N
F. Darema, D.A. George, V.A. Norton, G.F. Pfister, A single-program-multiple-data computational model for EPEX/FORTRAN parallel computing. ,vol. 7, pp. 11- 24 ,(1988) , 10.1016/0167-8191(88)90094-4
Thomas Fahringer, Matthew Haines, Piyush Mehrotra, On the utility of threads for data parallel programming Proceedings of the 9th international conference on Supercomputing - ICS '95. pp. 51- 59 ,(1995) , 10.1145/224538.224544
Michael J. Quinn, Philip J. Hatcher, On the Utility of Communication-Computation Overlap in Data-Parallel Programs Journal of Parallel and Distributed Computing. ,vol. 33, pp. 197- 204 ,(1996) , 10.1006/JPDC.1996.0038
Leslie Lamport, Time, clocks, and the ordering of events in a distributed system Communications of the ACM. ,vol. 21, pp. 558- 565 ,(1978) , 10.1145/359545.359563