Automatic performance debugging of SPMD-style parallel programs

作者: Xu Liu , Jianfeng Zhan , Kunlin Zhan , Weisong Shi , Lin Yuan

DOI: 10.1016/J.JPDC.2011.03.006

关键词:

摘要: Automatic performance debugging of parallel applications includes two main steps: locating bottlenecks and uncovering their root causes for optimization. Previous work fails to resolve this challenging issue in ways: first, several previous efforts automate bottlenecks, but present results a confined way that only identifies problems with priori knowledge; second, tools take exploratory or confirmatory data analysis automatically discover relevant relationships, these do not focus on causes. The simple program multiple (SPMD) programming model is widely used both high computing Cloud computing. In paper, we design implement an innovative system, AutoAnalyzer, automates the process SPMD-style programs, including collection, behavior analysis, AutoAnalyzer unique terms features: without any prior knowledge, it locates uncovers optimization; lightweight size be collected analyzed. Our contributions are three-fold: propose effective clustering algorithms investigate existence cause dissimilarity code region disparity, respectively; meanwhile, searching locate bottlenecks; basis rough set theory, approach uncover third, cluster systems different configurations, use production applications, written Fortran 77, one open source code-MPIBZIP2 (http://compression.ca/mpibzip2/), C++, verify effectiveness correctness our methods. For three also experimental investigating effects metrics bottlenecks.

参考文章(55)
George Tzoumas, Elias Leontiadis, Vassilios V. Dimakopoulos, A portable C compiler for OpenMP V.2.0 ,(2005)
Bernd Mohr, Felix Wolf, KOJAK – A Tool Set for Automatic Performance Analysis of Parallel Programs european conference on parallel processing. pp. 1301- 1304 ,(2003) , 10.1007/978-3-540-45209-6_177
Jack Dongarra, Bernd Mohr, Shirley Moore, Felix Wolf, Automatic analysis of inefficiency patterns in parallel applications: Research Articles Concurrency and Computation: Practice and Experience. ,vol. 19, pp. 1481- 1496 ,(2007) , 10.1002/CPE.V19:11
Umberto Villano, Massimiliano Rak, Beniamino Di Martino, Emilio Mancini, Roberto Torella, Cluster systems and simulation: from benchmarking to off-line performance prediction: Research Articles Concurrency and Computation: Practice and Experience. ,vol. 19, pp. 1549- 1562 ,(2007) , 10.1002/CPE.V19:11
A. D. Malony, L. Li, Knowledge engineering for automatic parallel performance diagnosis: Research Articles Concurrency and Computation: Practice and Experience. ,vol. 19, pp. 1497- 1515 ,(2007) , 10.1002/CPE.V19:11
Jeffrey K. Hollingsworth, Michael Steele, Grindstone: A Test Suite for Parallel Performance Tools ,(1998)
John Mellor-Crummey, Robert J. Fowler, Gabriel Marin, Nathan Tallent, HPCVIEW: A Tool for Top-down Analysis of Node Performance The Journal of Supercomputing. ,vol. 23, pp. 81- 104 ,(2002) , 10.1023/A:1015789220266
Philip C. Roth, Barton P. Miller, Deep Start: A Hybrid Strategy for Automated Performance Problem Searches european conference on parallel processing. pp. 86- 96 ,(2002) , 10.1007/3-540-45706-2_9
Jianfeng Zhan, Bibo Tu, Ming Zou, Xu Liu, Dan Meng, Similarity Analysis in Automatic Performance Debugging of SPMD Parallel Programs arXiv: Distributed, Parallel, and Cluster Computing. ,(2009)