Framework and Modular Infrastructure for Automation of Architectural Adaptation and Performance Optimization for HPC Systems

作者: Leonardo Fialho , James Browne

DOI: 10.1007/978-3-319-07518-1_17

关键词: WorkflowModular designSource codeDistributed computingAdaptation (computer science)Operating systemCode segmentXeon PhiImplementationAutomationComputer science

摘要: High performance systems have complex, diverse and rapidly evolving architectures. The span of applications, workloads, resource use patterns is diversifying. Adapting applications for efficient execution on this spectrum environments effort intensive. There are many optimization tools which implement some or several aspects the full task but almost none comprehensive across architectures, environments, workloads. This paper presents, illustrates, applies a modular infrastructure enables composition multiple open-source analyses into set workflows implementing end-to-end HPC architectures types parallel environments. It gives results from an implementation Stampede system at Texas Advanced Computing Center where user can submit application using only single command line get back least, partially optimized program without manual modification two different chips. Currently, subset possible optimizations completely automated growing. Case studies workflow presented. implementations currently available download as PerfExpert tool version 4.0 supports both Sandy Bridge Intel Phi

参考文章(28)
Renato Miceli, Gilles Civario, Anna Sikora, Eduardo César, Michael Gerndt, Houssam Haitof, Carmen Navarrete, Siegfried Benkner, Martin Sandrieser, Laurent Morin, François Bodin, AutoTune: a plugin-driven approach to the automatic tuning of parallel applications parallel computing. pp. 328- 342 ,(2012) , 10.1007/978-3-642-36803-5_24
Markus Schordan, Dan Quinlan, A Source-To-Source Architecture for User-Defined Optimizations Lecture Notes in Computer Science. pp. 214- 223 ,(2003) , 10.1007/978-3-540-45213-3_27
Ravi Sethi, Stephen C. Johnson, Yacc: a parser generator UNIX Vol. II. pp. 347- 374 ,(1990)
Nicholas Nethercote, Julian Seward, Valgrind: A Program Supervision Framework Electronic Notes in Theoretical Computer Science. ,vol. 89, pp. 44- 66 ,(2003) , 10.1016/S1571-0661(04)81042-9
Dieter an Mey, Scott Biersdorf, Christian Bischof, Kai Diethelm, Dominic Eschweiler, Michael Gerndt, Andreas Knüpfer, Daniel Lorenz, Allen Malony, Wolfgang E. Nagel, Yury Oleynik, Christian Rössel, Pavel Saviankou, Dirk Schmidl, Sameer Shende, Michael Wagner, Bert Wesarg, Felix Wolf, Score-P: A Unified Performance Measurement System for Petascale Applications CHPC. pp. 85- 97 ,(2011) , 10.1007/978-3-642-24025-6_8
Seon Wook Kim, Insung Park, Rudolf Eigenmann, A Performance Advisor Tool for Shared-Memory Parallel Programming languages and compilers for parallel computing. pp. 274- 288 ,(2000) , 10.1007/3-540-45574-4_18
Zhelong Pan, Brian Armstrong, Hansang Bae, Rudolf Eigenmann, On the interaction of tiling and automatic parallelization international workshop on openmp. pp. 24- 35 ,(2005) , 10.1007/978-3-540-68555-5_3
Shirley Browne, Jack Dongarra, Nathan Garner, George Ho, Philip Mucci, A Portable Programming Interface for Performance Evaluation on Modern Processors ieee international conference on high performance computing data and analytics. ,vol. 14, pp. 189- 204 ,(2000) , 10.1177/109434200001400303
Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris, Knowledge support and automation for performance analysis with PerfExplorer 2.0 Scientific Programming. ,vol. 16, pp. 123- 134 ,(2008) , 10.1155/2008/985194
Rudolf Eigenmann, Toward a methodology of optimizing programs for high-performance computers Proceedings of the 7th international conference on Supercomputing - ICS '93. pp. 27- 36 ,(1993) , 10.1145/165939.165948