Lessons Learned from Optimizing Science Kernels for Intel's "Knights Corner"' Architecture

作者: Jack Deslippe , Brian Austin , Chris Daley , Woo-Sun Yang

DOI: 10.1109/MCSE.2015.28

关键词: PATH (variable)ArchitectureSupercomputerOperating systemXeon PhiComputer scienceComputer architecture

摘要: Optimizing the codes and kernels representing National Energy Research Scientific Computing Center's workload on Knights Corner architecture helped pave path for NERSC's newest machine. Cori will use next generation of Intel Xeon Phi processors: Landing.

参考文章(18)
Larry Meadows, Experiments with WRF on intel® many integrated core (intel MIC) architecture international workshop on openmp. pp. 130- 139 ,(2012) , 10.1007/978-3-642-30961-8_10
William L Briggs, A multigrid tutorial ,(1987)
Henk J. Sips, Yonggang Che, Jianbin Fang, Chuanfu Xu, Lilun Zhang, Ana Lucia Varbanescu, An Empirical Study of Intel Xeon Phi arXiv: Distributed, Parallel, and Cluster Computing. ,(2013)
Iain Bethune, Fiona Reid, Optimising CP2K for the Intel Xeon Phi ,(2013)
Michael F. Wehner, Leonid Oliker, John Shalf, David Donofrio, Leroy A. Drummond, Ross Heikes, Shoaib Kamil, Celal Kono, Norman Miller, Hiroaki Miura, Marghoob Mohiyuddin, David Randall, Woo-Sun Yang, Hardware/software co‐design of global cloud system resolving models Journal of Advances in Modeling Earth Systems. ,vol. 3, ,(2011) , 10.1029/2011MS000073
Qiang Wu, Canqun Yang, Tao Tang, Liquan Xiao, MIC acceleration of short-range molecular dynamics simulations Proceedings of the First International Workshop on Code OptimiSation for MultI and many Cores - COSMIC '13. pp. 2- ,(2013) , 10.1145/2446920.2446922
Xing Liu, Edmond Chow, Large-Scale Hydrodynamic Brownian Simulations on Multicore and Manycore Architectures international parallel and distributed processing symposium. pp. 563- 572 ,(2014) , 10.1109/IPDPS.2014.65
Pawel Gepner, Victor Gamayunov, David L. Fraser, Eric Houdard, Ludovic Sauge, Damien Declat, Mathieu Dubois, Evaluation of DGEMM Implementation on Intel Xeon Phi Coprocessor Journal of Computers. ,vol. 9, pp. 1566- 1571 ,(2014) , 10.4304/JCP.9.7.1566-1571
William L. Briggs, Van Emden Henson, Steve F. McCormick, A multigrid tutorial: second edition Society for Industrial and Applied Mathematics. ,(2000) , 10.1137/1.9780898719505
Subhash Saini, Haoqiang Jin, Dennis Jespersen, Huiyu Feng, Jahed Djomehri, William Arasin, Robert Hood, Piyush Mehrotra, Rupak Biswas, An early performance evaluation of many integrated core architecture based SGI rackable computing system ieee international conference on high performance computing data and analytics. pp. 94- ,(2013) , 10.1145/2503210.2503272