Self-healing in binomial graph networks

作者: Thara Angskun , George Bosilca , Jack Dongarra

DOI: 10.1007/978-3-540-76890-6_30

关键词:

摘要: The number of processors embedded in high performance computing platforms is growing daily to solve larger and more complex problems. However, as the components increases, so does probability failure. logical network topologies must also support fault-tolerant capability such dynamic environments. This paper presents a self-healing mechanism improve Binomial graph (BMG) network. protects BMG from bisection helps maintain optimal routing even failure circumstances. experimental results show that with an adaptive method significantly reduces overhead reconstructing networks.

参考文章(35)
Hans Meuer, E. Strohmaier, J. Dongarra, Horst Simon, Top500 Supercomputer Sites University of Tennessee. ,(1997)
Marvin Theimer, Alec Wolman, Michael B. Jones, Stefan Saroiu, Nicholas J. A. Harvey, SkipNet: a scalable overlay network with practical locality properties usenix symposium on internet technologies and systems. pp. 9- 9 ,(2003)
John D. Kubiatowicz, Anthony D. Joseph, Ben Y. Zhao, Tapestry: An Infrastructure for Fault-tolerant Wide-area Location and University of California at Berkeley. ,(2001)
Thara Angskun, Graham E. Fagg, George Bosilca, Jelena Pješivac–Grbović, Jack J. Dongarra, Scalable Fault Tolerant Protocol for Parallel Runtime Environments Recent Advances in Parallel Virtual Machine and Message Passing Interface. pp. 141- 149 ,(2006) , 10.1007/11846802_25
Ben Yanbin Zhao, John Kubiatowicz, Anthony D Joseph, Tapestry: An Infrastructure for Fault-tolerant Wide-area Location and Routing ,(2001)
Dahlia Malkhi, Moni Naor, David Ratajczak, Viceroy Proceedings of the twenty-first annual symposium on Principles of distributed computing - PODC '02. pp. 183- 192 ,(2002) , 10.1145/571825.571857
Stuart Campbell, Mohan Kumar, Stephan Olariu, The hierarchical cliques interconnection network Journal of Parallel and Distributed Computing. ,vol. 64, pp. 16- 28 ,(2004) , 10.1016/J.JPDC.2003.08.005
S. Banerjee, D. Sarkar, Hypercube connected rings: a scalable and fault-tolerant logical topology for optical networks Computer Communications. ,vol. 24, pp. 1060- 1079 ,(2001) , 10.1016/S0140-3664(00)00336-4
Thara Angskun, George Bosilca, Jack Dongarra, Binomial graph: a scalable and fault-tolerant logical network topology international symposium on parallel and distributed processing and applications. pp. 471- 482 ,(2007) , 10.1007/978-3-540-74742-0_43