An evaluation of global address space languages

作者: Cristian Coarfa , Yuri Dotsenko , John Mellor-Crummey , François Cantonnet , Tarek El-Ghazawi

DOI: 10.1145/1065944.1065950

关键词:

摘要: Co-array Fortran (CAF) and Unified Parallel C (UPC) are two emerging languages for single-program, multiple-data global address space programming. These boost programmer productivity by providing shared variables inter-process communication instead of message passing. However, the performance these still has room improvement. In this paper, we study variants NAS MG, CG, SP, BT benchmarks on several modern architectures to identify challenges that must be met deliver top performance. We compare CAF UPC programs with original Fortran+MPI code. Today, scalable clusters only when written use bulk communication. our experiments uncovered some significant bottlenecks codes all platforms. account root causes limiting such as synchronization model, efficiency strided data, source-to-source translation issues. show they can remedied language extensions, new constructs, and, finally, adequate optimizations back-end compilers.

参考文章(12)
Dan Bonachea, P. Hargrove, GASNet Specification, v1.1 University of California at Berkeley. ,(2002) , 10.2172/1398512
Yuri Dotsenko, Cristian Coarfa, John Mellor-Crummey, Daniel Chavarría-Miranda, Experiences with co-array fortran on hardware shared memory platforms ieee international conference on high performance computing data and analytics. pp. 332- 347 ,(2004) , 10.1007/11532378_24
John Mellor-Crummey, Robert J. Fowler, Gabriel Marin, Nathan Tallent, HPCVIEW: A Tool for Top-down Analysis of Node Performance The Journal of Supercomputing. ,vol. 23, pp. 81- 104 ,(2002) , 10.1023/A:1015789220266
Cristian Coarfa, Yuri Dotsenko, Jason Eckhardt, John Mellor-Crummey, Co-array Fortran Performance and Potential: An NPB Experimental Study languages and compilers for parallel computing. pp. 177- 193 ,(2003) , 10.1007/978-3-540-24644-2_12
Jarek Nieplocha, Bryan Carpenter, ARMCI: A Portable Remote Memory Copy Libray for Ditributed Array Libraries and Compiler Run-Time Systems international parallel processing symposium. pp. 533- 546 ,(1999) , 10.1007/BFB0097937
Cristian Coarfa, Yuri Dotsenko, John Mellor-Crummey, A Multi-Platform Co-Array Fortran Compiler international conference on parallel architectures and compilation techniques. pp. 29- 40 ,(2004) , 10.5555/1025127.1025994
Cantonnet François, Yao Yiyi, Annareddy Smita, Mohamed Ahmed, El-Ghazawi Tarek, None, Performance monitoring and evaluation of a UPC implementation on a NUMA architecture international parallel and distributed processing symposium. pp. 274- ,(2003) , 10.1109/IPDPS.2003.1213492
Tarek El-Ghazawi, Francois Cantonnet, UPC Performance and Potential: A NPB Experimental Study conference on high performance computing (supercomputing). pp. 1- 26 ,(2002) , 10.5555/762761.762821
Wei-Yu Chen, Dan Bonachea, Jason Duell, Parry Husbands, Costin Iancu, Katherine Yelick, A performance analysis of the Berkeley UPC compiler international conference on supercomputing. pp. 63- 73 ,(2003) , 10.1145/782814.782825