An adaptive virtual machine replication algorithm for highly-available services

作者: Adrian Colesa , Bica Mihai

DOI:

关键词:

摘要: This paper presents an adaptive algorithm for the replication process of a primary virtual machine (VM) hosting service that must be provided high-availability. Running in VM and replicating entire is general strategy, totally transparent itself its clients. The takes place phases, which are run asynchronous efficiency reasons. adapts to running context, consisting behavior available bandwidth between backup nodes. length each phase determined dynamically, order reduce as much possible latencies experienced by clients service, especially case degraded connectivity We implemented our extension Xen hypervisor's migration operation. It proved better than non-adaptive variants.

参考文章(16)
Mike Feeley, Brendan Cully, Andrew Warfield, Dutch Meyer, Geoffrey Lefebvre, Norm Hutchinson, Remus: high availability via asynchronous virtual machine replication networked systems design and implementation. pp. 161- 174 ,(2008)
Rachid Guerraoui, André Schiper, Fault-Tolerance by Replication in Distributed Systems international conference on reliable software technologies. pp. 38- 57 ,(1996) , 10.1007/BFB0013477
Zhenyu Guo, Zhilei Xu, Xuezheng Liu, Xi Wang, Ming Wu, Zheng Zhang, Jian Tang, M. Frans Kaashoek, R2: an application-level kernel for record and replay operating systems design and implementation. pp. 193- 208 ,(2008) , 10.5555/1855741.1855755
Christian Limpach, Eric Jul, Andrew Warfield, Steven Hand, Ian Pratt, Christopher Clark, Jacob Gorm Hansen, Keir Fraser, Live migration of virtual machines networked systems design and implementation. pp. 273- 286 ,(2005) , 10.5555/1251203.1251223
Flaviu Cristian, Bob Dancey, Jon Dehn, Fault-tolerance in the advanced automation system acm sigops european workshop. pp. 6- 17 ,(1990) , 10.1145/504136.504156
Yasushi Saito, Jockey: a user-space library for record-replay debugging international symposium on open collaboration. pp. 69- 76 ,(2005) , 10.1145/1085130.1085139
A Colesa, I Stan, I Ignat, Transparent Fault-Tolerance Based on Asynchronous Virtual Machine Replication symbolic and numeric algorithms for scientific computing. pp. 442- 448 ,(2010) , 10.1109/SYNASC.2010.58
Manish Marwah, Shivakant Mishra, Christof Fetzer, Fault-tolerant and scalable TCP splice and web server architecture 2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06). pp. 301- 310 ,(2006) , 10.1109/SRDS.2006.21
Kenneth P. Birman, Thomas A. Joseph, Reliable communication in the presence of failures ACM Transactions on Computer Systems. ,vol. 5, pp. 47- 76 ,(1987) , 10.1145/7351.7478