DASH: A C++ PGAS Library for Distributed Data Structures and Parallel Algorithms

作者: Tobias Fuchs , Roger Kowalewski , Karl Fürlinger

DOI:

关键词: Computer scienceParallel algorithmParallel computingData structureStandard Template LibraryBenchmark (computing)LocalityScalabilityPartitioned global address spaceDash

摘要: We present DASH, a C++ template library that offers distributed data structures and parallel algorithms implements compiler-free PGAS (partitioned global address space) approach. DASH many productivity performance features such as global-view structures, efficient support for the owner-computes model, flexible multidimensional distribution schemes inter-operability with STL (standard library) algorithms. also representation of target machine allows exploitation several hierarchically organized levels locality through concept Teams. evaluate on number benchmark applications we port scientific proxy application using MPI two-sided model to DASH. find excellent demonstrate scalability up 9800 cores.

参考文章(25)
Amir Kamil, Katherine Yelick, Hierarchical computation in the SPMD programming model languages and compilers for parallel computing. ,vol. 8664, pp. 3- 19 ,(2013) , 10.1007/978-3-319-09967-5_1
Yonghong Yan, Jisheng Zhao, Yi Guo, Vivek Sarkar, Hierarchical place trees: a portable abstraction for task parallelism and data movement languages and compilers for parallel computing. ,vol. 5898, pp. 172- 187 ,(2009) , 10.1007/978-3-642-13374-9_12
Roberto Belli, Torsten Hoefler, Notified Access: Extending Remote Memory Access Programming Models for Producer-Consumer Synchronization international parallel and distributed processing symposium. pp. 871- 881 ,(2015) , 10.1109/IPDPS.2015.30
Lei Zhou, Karl Fuerlinger, DART-CUDA: A PGAS Runtime System for Multi-GPU Systems 2015 14th International Symposium on Parallel and Distributed Computing. pp. 110- 119 ,(2015) , 10.1109/ISPDC.2015.20
Jarek Nieplocha, Bryan Carpenter, ARMCI: A Portable Remote Memory Copy Libray for Ditributed Array Libraries and Compiler Run-Time Systems international parallel processing symposium. pp. 533- 546 ,(1999) , 10.1007/BFB0097937
Antal Buss, Nancy M. Amato, Lawrence Rauchwerger, Harshvardhan, Ioannis Papadopoulos, Olga Pearce, Timmie Smith, Gabriel Tanase, Nathan Thomas, Xiabing Xu, Mauro Bianco, STAPL Proceedings of the 3rd Annual Haifa Experimental Systems Conference on - SYSTOR '10. pp. 14- ,(2010) , 10.1145/1815695.1815713
John Mellor-Crummey, Laksono Adhianto, William N. Scherer, Guohua Jin, A new vision for coarray Fortran Proceedings of the Third Conference on Partitioned Global Address Space Programing Models - PGAS '09. pp. 5- ,(2009) , 10.1145/1809961.1809969
Franois Broquedis, Jerome Clet-Ortega, Stephanie Moreaud, Nathalie Furmento, Brice Goglin, Guillaume Mercier, Samuel Thibault, Raymond Namyst, hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications parallel, distributed and network-based processing. pp. 180- 186 ,(2010) , 10.1109/PDP.2010.67
Barbara Chapman, Tony Curtis, Swaroop Pophale, Stephen Poole, Jeff Kuehn, Chuck Koelbel, Lauren Smith, Introducing OpenSHMEM Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model - PGAS '10. pp. 2- ,(2010) , 10.1145/2020373.2020375
Kathy Yelick, Luigi Semenzato, Geoff Pike, Carleton Miyamoto, Ben Liblit, Arvind Krishnamurthy, Paul Hilfinger, Susan Graham, David Gay, Phil Colella, Alex Aiken, None, Titanium: a high-performance Java dialect Concurrency and Computation: Practice and Experience. ,vol. 10, pp. 825- 836 ,(1998) , 10.1002/(SICI)1096-9128(199809/11)10:11/13<825::AID-CPE383>3.0.CO;2-H