Evaluating similarity-based trace reduction techniques for scalable performance analysis

作者: Kathryn Mohror , Karen L. Karavanic

DOI: 10.1145/1654059.1654115

关键词: Data miningSimilarity (geometry)Distributed computingTRACE (psycholinguistics)ScalabilityComputer scienceEvent (computing)Reduction (complexity)Volume (computing)

摘要: Event traces are required to correctly diagnose a number of performance problems that arise on today's highly parallel systems. Unfortunately, the collection event can produce large volume data is difficult, or even impossible, store and analyze. One approach for compressing trace identify repeating patterns retain only one representative each pattern. However, determining similarity sections traces, i.e., identifying patterns, not straightforward. In this paper, we investigate pattern-based methods reducing will be used analysis. We evaluate different against several criteria, including size reduction, introduced error, retention trends, using both benchmarks with carefully chosen behaviors, real application.

参考文章(38)
Jesús Labarta, Rosa M. Badia, Marc Casas, Automatic Phase Detection of MPI Applications. parallel computing. pp. 129- 136 ,(2007)
Kathryn Mohror, Karen L. Karavanic, Towards scalable event tracing for high end systems high performance computing and communications. pp. 695- 706 ,(2007) , 10.1007/978-3-540-75444-2_65
Andreas Knüpfer, A new data compression technique for event based program traces international conference on computational science. pp. 956- 965 ,(2003) , 10.1007/3-540-44863-2_94
Andreas Knüpfer, Wolfgang E. Nagel, Dieter Kranzlmüller, Pattern Matching of Collective MPI Operations. parallel and distributed processing techniques and applications. pp. 1243- 1249 ,(2004)
Laura Carrington, Allan Snavely, Xiaofeng Gao, Nicole Wolter, A performance prediction framework for scientific applications international conference on computational science. pp. 926- 935 ,(2003) , 10.1007/3-540-44863-2_91
Daved C. Bailey, Historia general de México Americas. ,vol. 58, pp. 481- 482 ,(1978) , 10.1215/00182168-58.3.481
Anders la Cour-Harbo, Arne Jensen, Ripples in Mathematics: The Discrete Wavelet Transform ,(2014)
Laxmikant V. Kalé, Sameer Kumar, Gengbin Zheng, Chee Wai Lee, Scaling molecular dynamics to 3000 processors with projections: a performance analysis case study international conference on computational science. pp. 23- 32 ,(2003) , 10.1007/3-540-44864-0_3
Michael Gerndt, Bernd Mohr, Jesper Larsson Träff, A test suite for parallel performance analysis tools Concurrency and Computation: Practice and Experience. ,vol. 19, pp. 1465- 1480 ,(2007) , 10.1002/CPE.1124
Prasun Ratn, Frank Mueller, Bronis R. de Supinski, Martin Schulz, Preserving time in large-scale communication traces Proceedings of the 22nd annual international conference on Supercomputing - ICS '08. pp. 46- 55 ,(2008) , 10.1145/1375527.1375537