作者: John Linford , Tyler A. Simon , Sameer Shende , Allen D. Malony
DOI: 10.1007/978-3-319-05215-1_8
关键词:
摘要: The recent development of a unified SHMEM framework, OpenSHMEM, has enabled further study in the porting and scaling applications that can benefit from programming model. This paper focuses on non-numerical graph algorithms, which typically have low FLOPS/byte ratio. An overview space time complexity Kruskal's Prim's algorithms for generating minimum spanning tree (MST) is presented, along with an implementation algorithm uses OpenSHEM to generate MST parallel without intermediate communication. Additionally, procedure applying TAU Performance System OpenSHMEM produce indepth performance profiles showing spent code regions, memory access patterns, network load presented. evaluations Cray XK7 "Titan" system at Oak Ridge National Laboratory 48 core shared University Maryland, Baltimore County are provided.