Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems

作者: Gábor Dózsa , Sameer Kumar , Pavan Balaji , Darius Buntinas , David Goodell

DOI: 10.1007/978-3-642-15646-5_2

关键词: Task (computing)Multi-core processorMessage queueNetwork interfaceNode (networking)Parallel computingShared memoryCritical sectionPetascale computingComputer science

摘要: With the ever-increasing numbers of cores per node on HPC systems, applications are increasingly using threads to exploit shared memory within a node, combined with MPI across nodes. Achieving high performance when large number concurrent make calls is challenging task for an implementation. We describe design and implementation our solution in MPICH2 achieve high-performance multithreaded communication IBM Blue Gene/P. use combination multichannel-enabled network interface, fine-grained locks, lock-free atomic operations, specially designed queues provide degree access while still maintaining MPI's message-ordering semantics. present results that demonstrate new improves message rate by factor 3.6 compared existing BG/P. Our solutions also applicable other high-end systems have parallel capabilities.

参考文章(7)
Jin Haopiang, Rob F. vanderWijngaart, NAS Parallel Benchmarks, Multi-Zone Versions ,(2003)
Sameer Kumar, Joseph Ratterman, Brian Smith, Charles J. Archer, Gabor Dozsa, Gheorghe Almasi, Philip Heidelberger, Dong Chen, Mark E. Giampapa, Michael Blocksome, Ahmad Faraj, Jeff Parker, The deep computing messaging framework Proceedings of the 22nd annual international conference on Supercomputing - ICS '08. pp. 94- 103 ,(2008) , 10.1145/1375527.1375544
D.H. Bailey, E. Barszcz, J.T. Barton, D.S. Browning, R.L. Carter, L. Dagum, R.A. Fatoohi, P.O. Frederickson, T.A. Lasinski, R.S. Schreiber, H.D. Simon, V. Venkatakrishnan, S.K. Weeratunga, The Nas Parallel Benchmarks ieee international conference on high performance computing data and analytics. ,vol. 5, pp. 63- 73 ,(1991) , 10.1177/109434209100500306
William Gropp, Rajeev Thakur, Thread-safety in an MPI implementation: Requirements and analysis parallel computing. ,vol. 33, pp. 595- 604 ,(2007) , 10.1016/J.PARCO.2007.07.002
Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Rajeev Thakur, Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming ieee international conference on high performance computing data and analytics. ,vol. 24, pp. 49- 57 ,(2010) , 10.1177/1094342009360206
Rajeev Thakur, William Gropp, Test suite for evaluating performance of multithreaded MPI communication parallel computing. ,vol. 35, pp. 608- 617 ,(2009) , 10.1016/J.PARCO.2008.12.013