A performance study of general-purpose applications on graphics processors using CUDA

作者: Shuai Che , Michael Boyer , Jiayuan Meng , David Tarjan , Jeremy W. Sheaffer

DOI: 10.1016/J.JPDC.2008.05.014

关键词:

摘要: … All of our applications show satisfactory speedups, but the main contribution of our work … reduction using the OpenMP reduction pragma. The CUDA version performs a manual reduction …

参考文章(27)
Masumeh Damrudi, Kamal Jadidy Aval, Parallel sorting on ILLIAC array processor international conference on systems. pp. 260- 263 ,(2007)
Dave Baldwin, Randi Rost, John Kessenich, The OpenGL® Shading Language ,(2006)
Ethel I. Swail, N. D. Durie, Digest of papers Conference Committee. ,(1976)
Tom Goodale, Gabrielle Allen, Gerd Lanfermann, Joan Massó, Thomas Radke, Edward Seidel, John Shalf, The Cactus Framework and Toolkit: Design and Applications Lecture Notes in Computer Science. pp. 197- 227 ,(2003) , 10.1007/3-540-36569-9_13
Michael McCool, Stefanus Du Toit, Metaprogramming GPUs with Sh ,(2004)
George Vahala, Jonathan Carter, Min Soe, Linda Vahala, Jeffrey Yepez, 3D Entropic Lattice Boltzmann Simulations of 3D Navier-Stokes Turbulence Bulletin of the American Physical Society. ,vol. 47, ,(2005)
Christopher I. Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-Mei W. Hwu, GPU acceleration of cutoff pair potentials for molecular modeling applications Proceedings of the 2008 conference on Computing frontiers - CF '08. pp. 273- 282 ,(2008) , 10.1145/1366230.1366277
John Nickolls, Ian Buck, Michael Garland, Kevin Skadron, Scalable parallel programming with CUDA ACM SIGGRAPH 2008 classes on - SIGGRAPH '08. ,vol. 6, pp. 40- 53 ,(2008) , 10.1145/1401132.1401152
Ian Buck, Tim Foley, Daniel Horn, Jeremy Sugerman, Kayvon Fatahalian, Mike Houston, Pat Hanrahan, Brook for GPUs ACM Transactions on Graphics. ,vol. 23, pp. 777- 786 ,(2004) , 10.1145/1015706.1015800