Performance Evaluation of Unified Parallel C Collective Communications

作者: Guillermo L Taboada , Carlos Teijeiro , Juan Tourino , Basilio B Fraguela , Ramón Doallo

DOI: 10.1109/HPCC.2009.88

关键词:

摘要: Unified Parallel C (UPC) is an extension of ANSI designed for parallel programming. UPC collective primitives, which are part the standard, increase programming productivity while reducing communication overhead. This paper presents up-to-date performance evaluation two publicly available implementations on three scenarios: shared, distributed, and hybrid shared/distributed memory architectures. The characterization throughput primitives useful increasing through runtime selection appropriate primitive implementation, depends message size architecture, as well to detect inefficient implementations. In fact, based analysis collectives performance, we proposed some optimizations current libraries. We have also compared their MPI counterparts, showing that there room improvement. Finally, this concludes with influence a representative communication-intensive application, optimization highly important scalability.

参考文章(10)
Ahmed Sameh, Rafik Amir Salama, Potential Performance Improvement of Collective Operations in UPC. parallel computing. pp. 413- 422 ,(2007)
William Gropp, Ewing Lusk, Reproducible Measurements of MPI Performance Characteristics european pvm mpi users group meeting on recent advances in parallel virtual machine and message passing interface. pp. 11- 18 ,(1999) , 10.1007/3-540-48158-3_2
David E. Culler, Tarek A. El-Ghazawi, Kathy Yelick, William W. Carlson, Jesse M. Draper, Eugene Brooks, George Washington, UPC Language Specifications V1.1.1 ,(2003)
Tarek A. El-Ghazawi, François Cantonnet, Yiyi Yao, Smita Annareddy, Ahmed S. Mohamed, Benchmarking parallel compilers: a UPC case study Future Generation Computer Systems. ,vol. 22, pp. 764- 775 ,(2006) , 10.1016/J.FUTURE.2006.02.002
Cristian Coarfa, Yuri Dotsenko, John Mellor-Crummey, François Cantonnet, Tarek El-Ghazawi, Ashrujit Mohanti, Yiyi Yao, Daniel Chavarría-Miranda, An evaluation of global address space languages Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming - PPoPP '05. pp. 36- 47 ,(2005) , 10.1145/1065944.1065950
Rajesh Nishtala, George Almasi, Calin Cascaval, Performance without pain = productivity Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming - PPoPP '08. pp. 99- 110 ,(2008) , 10.1145/1345206.1345224
Tarek El-Ghazawi, Francois Cantonnet, UPC Performance and Potential: A NPB Experimental Study conference on high performance computing (supercomputing). pp. 1- 26 ,(2002) , 10.5555/762761.762821
Zhang Zhang, S. Seidel, Benchmark measurements of current UPC platforms international parallel and distributed processing symposium. pp. 276- ,(2005) , 10.1109/IPDPS.2005.123
Thorsten von Eicken, David E. Culler, Seth Copen Goldstein, Klaus Erik Schauser, Active messages Proceedings of the 19th annual international symposium on Computer architecture - ISCA '92. ,vol. 20, pp. 256- 266 ,(1992) , 10.1145/139669.140382
T. El-Ghazawi, S. Chauvin, UPC benchmarking issues international conference on parallel processing. pp. 365- 372 ,(2001) , 10.1109/ICPP.2001.952082