APEnet+: a 3D Torus network optimized for GPU-based HPC Systems

作者: R Ammendola , A Biagioni , O Frezza , F Lo Cicero , A Lonardo

DOI: 10.1088/1742-6596/396/4/042059

关键词: Grid networkSupercomputerRemote direct memory accessEmbedded systemPCI ExpressNetwork interface controllerInterconnectionField-programmable gate arrayComputer scienceData transmission

摘要: In the supercomputing arena, strong rise of GPU-accelerated clusters is a matter fact. Within INFN, we proposed an initiative — QUonG project whose aim to deploy high performance computing system dedicated scientific computations leveraging on commodity multi-core processors coupled with latest generation GPUs. The inter-node interconnection based point-to-point, performance, low latency 3D torus network which built in framework APEnet+ project. It takes form FPGA-based PCIe card exposing six full bidirectional links running at 34 Gbps each that implements RDMA protocol. order enable significant access reduction for data transfer, direct network-to-GPU interface was built. specialized hardware blocks, integrated board, provide support GPU-initiated communications using so called peer-to-peer (P2P) transactions. This development made close collaboration GPU vendor NVIDIA. final shape complete deployment assembly standard 42U racks, one capable 80 TFLOPS/rack peak cost 5 k€/T F LOPS and estimated power consumption 25 kW/rack. this paper report status rack R&D activities 2012 will focus enhancement through adoption new 28 nm FPGAs allowing implementation Gen3 host addition fault tolerance-oriented capabilities.

参考文章(15)
Roberto Ammendola, Roberto Petronzio, Davide Rossetti, Andrea Salamon, Nazario Tantalo, Piero Vicini, Status of the APENet project Proceedings of XXIIIrd International Symposium on Lattice Field Theory — PoS(LAT2005). ,(2005) , 10.22323/1.020.0100
R. Ammendola, M. Guagnelli, G. Mazza, F. Palombi, R. Petronzio, D. Rossetti, A. Salamon, P. Vicini, APENet: LQCD clusters a la APE Nuclear Physics B - Proceedings Supplements. ,vol. 140, pp. 826- 828 ,(2005) , 10.1016/J.NUCLPHYSBPS.2004.11.373
R Ammendola, A Biagioni, O Prezza, F Lo Cicero, A Lonardo, P S Paolucci, D Rossetti, A Salamon, G Salina, F Simula, L Tosoratto, P Vicini, APEnet+: high bandwidth 3D torus direct network for petaflops scale commodity clusters arXiv: Computational Physics. ,(2011) , 10.1088/1742-6596/331/5/052029
D. Chen, N.H. Christ, C. Cristian, Z. Dong, A. Gara, K. Garg, B. Joo, C. Kim, L. Levkova, X. Liao, R.D. Mawhinney, S. Ohta, T. Wettig, QCDOC: A 10-teraflops scale computer for lattice QCD 18th International Symposium on Lattice Field Theory Lattice 2000, Bangalore (IN), 08/17/2000--08/22/2000. ,vol. 94, pp. 825- 832 ,(2001) , 10.1016/S0920-5632(01)01014-3
M. Albanese, P. Bacilieri, S. Cabasino, N. Cabibbo, F. Costantini, G. Fiorentini, F. Flore, A. Fonti, A. Fucci, M.P. Lombardo, S. Galeotti, P. Giacomelli, P. Marchesini, E. Marinari, F. Marzano, A. Miotto, P. Paolucci, G. Parisi, D. Pascoli, D. Passuello, S. Petrarca, F. Rapuano, E. Remiddi, R. Rusack, G. Salina, R. Tripiccione, The APE computer: An array processor optimized for lattice gauge theory simulations Computer Physics Communications. ,vol. 45, pp. 345- 353 ,(1987) , 10.1016/0010-4655(87)90172-X
P. A. Boyle, D. Chen, N. H. Christ, M. A. Clark, S. D. Cohen, C. Cristian, Z. Dong, A. Gara, B. Joo, C. Jung, C. Kim, L. A. Levkova, X. Liao, G. Liu, R. D. Mawhinney, S. Ohta, K. Petrov, T. Wettig, A. Yamaguchi, Overview of the QCDSP and QCDOC computers Ibm Journal of Research and Development. ,vol. 49, pp. 351- 365 ,(2005) , 10.1147/RD.492.0351
A Ukawa, T Yoshié, N Ishizuka, Y Iwasaki, G Boyd, S Aoki, K Kanaya, Y Kuramashi, Masanori Okawa, T Kaneko, R Burkhalter, S Hashimoto, Full QCD simulation on CP-PACS Nuclear Physics B Proceedings Supplements. ,vol. 60, pp. 335- 340 ,(1998) , 10.1016/S0920-5632(97)00494-5
A. Bartoloni, S. Cabasino, M. Cosimi, P. De Riso, A. Lonardo, A. Michelotti, E. Panizzi, P.S. Paolucci, D. Rossetti, M. Torelli, P. Vicini, N. Cabibbo, W. Errico, S. Giovannetti, F. Laico, G. Magazzú, R. Tripiccione, H. Simma, An overview of the APEmille project Nuclear Physics B - Proceedings Supplements. ,vol. 60, pp. 237- 240 ,(1998) , 10.1016/S0920-5632(97)00485-4
S. Cabasino, N. Cabibbo, L.A. Fernández, G. Fiorentini, A. Lai, M.P. Lombardo, E. Marinari, F. Marzano, P. Paolucci, G. Parisi, J. Pech, F. Rapuano, E. Remiddi, R. Sarno, G. Salina, A. Tarancón, G.M. Todesco, M. Torelli, R. Tripiccione, W. Tross, N. Avico, P. Bacilieri, From APE to APE-100: From 1 to 100 gflops in lattice gauge theory simulations Computer Physics Communications. ,vol. 57, pp. 285- 289 ,(1989) , 10.1016/0010-4655(89)90229-4
Roberto Petronzio, Francesca Lo Cicero, Pier Stanislao Paolucci, Nazario Tantalo, Ottorino Frezza, Alessandro Lonardo, Andrea Biagioni, Roberto Ammendola, Francesco Simula, Laura Tosoratto, Gaetano Salina, Piero Vicini, Andrea Salamon, Davide Rossetti, APEnet+: a 3D toroidal network enabling Petaflops scale Lattice QCD simulations on commodity clusters Proceedings of The XXVIII International Symposium on Lattice Field Theory. June 14-19. pp. 022- ,(2010)