Outcomes of the fault tolerance configuration

作者: Emilio Luque Fadón , Dolores Rexachs del Rosario , Angelo Duarte , Leonardo Fialho

DOI:

关键词:

摘要:

参考文章(7)
John Daly, A model for predicting the optimum checkpoint interval for restart dumps international conference on computational science. pp. 3- 12 ,(2003) , 10.1007/3-540-44864-0_1
Leonardo Fialho, Guna Santos, Angelo Duarte, Dolores Rexachs, Emilio Luque, None, Challenges and Issues of the Integration of RADIC into Open MPI european pvm mpi users group meeting on recent advances in parallel virtual machine and message passing interface. ,vol. 5759, pp. 73- 83 ,(2009) , 10.1007/978-3-642-03770-2_14
William Gropp, Ewing Lusk, Fault Tolerance in Message Passing Interface Programs ieee international conference on high performance computing data and analytics. ,vol. 18, pp. 363- 372 ,(2004) , 10.1177/1094342004046045
Camille Coti, Thomas Herault, Pierre Lemarinier, Laurence Pilard, Ala Rezmerita, Eric Rodriguez, Franck Cappello, Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI conference on high performance computing (supercomputing). pp. 127- ,(2006) , 10.1145/1188455.1188587
A.J. Oliner, R.K. Sahoo, J.E. Moreira, M. Gupta, Performance implications of periodic checkpointing on large-scale cluster systems international parallel and distributed processing symposium. pp. 299- ,(2005) , 10.1109/IPDPS.2005.337
Douglas Doerfler, Ron Brightwell, Measuring MPI Send and Receive Overhead and Application Availability in High Performance Network Interfaces Recent Advances in Parallel Virtual Machine and Message Passing Interface. pp. 331- 338 ,(2006) , 10.1007/11846802_46
A. Bouteiller, B. Collin, T. Herault, P. Lemarinier, F. Cappello, Impact of event logger on causal message logging protocols for fault tolerant MPI international parallel and distributed processing symposium. pp. 97- 97 ,(2005) , 10.1109/IPDPS.2005.249