Performance characterization of the NAS Parallel Benchmarks in OpenCL

作者: Sangmin Seo , Gangwon Jo , Jaejin Lee

DOI: 10.1109/IISWC.2011.6114174

关键词:

摘要: … on CPUs and GPUs, we enhance the current NPB suite and our characterization will give new … Figure 3 (a) shows the speedups for the Class B problem size and (b) for Class C. For MG, …

参考文章(19)
David A. Patterson, Samuel Webb Williams, Auto-tuning performance on multicore computers University of California at Berkeley. ,(2008)
Jin Haopiang, Rob F. vanderWijngaart, NAS Parallel Benchmarks, Multi-Zone Versions ,(2003)
Michael Frumkin, Jerry Yan, Hao-Qiang Jin, The OpenMP Implementation of NAS Parallel Benchmarks and its Performance ,(2013)
Shane Ryoo, Christopher I. Rodrigues, Sara S. Baghsorkhi, Sam S. Stone, David B. Kirk, Wen-mei W. Hwu, Optimization principles and application performance evaluation of a multithreaded GPU using CUDA Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming - PPoPP '08. pp. 73- 82 ,(2008) , 10.1145/1345206.1345220
S. J. Pennycook, S. D. Hammond, S. A. Jarvis, G. R. Mudalige, Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark ACM SIGMETRICS Performance Evaluation Review. ,vol. 38, pp. 23- 29 ,(2011) , 10.1145/1964218.1964223
Jayanth Gummaraju, Laurent Morichetti, Michael Houston, Ben Sander, Benedict R. Gaster, Bixia Zheng, Twin peaks: a software platform for heterogeneous computing on general-purpose and graphics processors international conference on parallel architectures and compilation techniques. pp. 205- 216 ,(2010) , 10.1145/1854273.1854302
Jaejin Lee, Seung Hak Lee, Seung Mo Cho, Hyo Jung Song, Sang-Bum Suh, Jong-Deok Choi, Jungwon Kim, Sangmin Seo, Seungkyun Kim, Jungho Park, Honggyu Kim, Thanh Tuan Dao, Yongjin Cho, Sung Jong Seo, An OpenCL framework for heterogeneous multicores with local memory international conference on parallel architectures and compilation techniques. pp. 193- 204 ,(2010) , 10.1145/1854273.1854301
Frederick C. Wong, Richard P. Martin, Remzi H. Arpaci-Dusseau, David E. Culler, Architectural Requirements and Scalability of the NAS Parallel Benchmarks conference on high performance computing (supercomputing). pp. 41- 41 ,(1999) , 10.1145/331532.331573
Shuai Che, Jeremy W. Sheaffer, Michael Boyer, Lukasz G. Szafaryn, Liang Wang, Kevin Skadron, A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads ieee international symposium on workload characterization. pp. 1- 11 ,(2010) , 10.1109/IISWC.2010.5650274
Shuai Che, Michael Boyer, Jiayuan Meng, David Tarjan, Jeremy W. Sheaffer, Sang-Ha Lee, Kevin Skadron, Rodinia: A benchmark suite for heterogeneous computing ieee international symposium on workload characterization. pp. 44- 54 ,(2009) , 10.1109/IISWC.2009.5306797