Rodinia: A benchmark suite for heterogeneous computing

作者: Shuai Che , Michael Boyer , Jiayuan Meng , David Tarjan , Jeremy W. Sheaffer

DOI: 10.1109/IISWC.2009.5306797

关键词:

摘要: … supported only on NVIDIA GPUs, but recent work has shown that CUDA programs can be compiled to execute efficiently on multi-core CPUs [32]. The NVIDIA GTX 280 GPU used in this …

参考文章(22)
Pawan Harish, P. J. Narayanan, Accelerating Large Graph Algorithms on the GPU Using CUDA High Performance Computing – HiPC 2007. pp. 197- 208 ,(2007) , 10.1007/978-3-540-77220-0_21
Jiayuan Meng, Kevin Skadron, Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs Proceedings of the 23rd international conference on Conference on Supercomputing - ICS '09. pp. 256- 265 ,(2009) , 10.1145/1542275.1542313
Shane Ryoo, Christopher I. Rodrigues, Sara S. Baghsorkhi, Sam S. Stone, David B. Kirk, Wen-mei W. Hwu, Optimization principles and application performance evaluation of a multithreaded GPU using CUDA Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming - PPoPP '08. pp. 73- 82 ,(2008) , 10.1145/1345206.1345220
John Nickolls, Ian Buck, Michael Garland, Kevin Skadron, Scalable parallel programming with CUDA ACM SIGGRAPH 2008 classes on - SIGGRAPH '08. ,vol. 6, pp. 40- 53 ,(2008) , 10.1145/1401132.1401152
J. A. Kahle, M. N. Day, H. P. Hofstee, C. R. Johns, T. R. Maeurer, D. Shippy, Introduction to the cell multiprocessor Ibm Journal of Research and Development. ,vol. 49, pp. 589- 604 ,(2005) , 10.1147/RD.494.0589
Kenneth Hoste, Lieven Eeckhout, Microarchitecture-Independent Workload Characterization IEEE Micro. ,vol. 27, pp. 63- 72 ,(2007) , 10.1109/MM.2007.56
Ajay Joshi, Aashish Phansalkar, L. Eeckhout, L.K. John, Measuring benchmark similarity using inherent program characteristics IEEE Transactions on Computers. ,vol. 55, pp. 769- 782 ,(2006) , 10.1109/TC.2006.85
Shuai Che, Jie Li, Jeremy W. Sheaffer, Kevin Skadron, John Lach, Accelerating Compute-Intensive Applications with GPUs and FPGAs symposium on application specific processors. pp. 101- 107 ,(2008) , 10.1109/SASP.2008.4570793
Shubhabrata Sengupta, Mark Harris, Yao Zhang, John D Owens, None, Scan primitives for GPU computing international conference on computer graphics and interactive techniques. pp. 97- 106 ,(2007) , 10.5555/1280094.1280110