Self-healing multitier architectures using cascading rescue points

作者: Angeliki Zavou , Georgios Portokalidis , Angelos D. Keromytis

DOI: 10.1145/2420950.2421005

关键词:

摘要: Software bugs and vulnerabilities cause serious problems to both home users the Internet infrastructure, limiting availability of services, causing loss data, reducing system integrity. self-healing using rescue points (RPs) is a known mechanism for recovering from unforeseen errors. However, applying it on multitier architectures can be problematic because certain actions, like transmitting data over network, cannot undone. We propose cascading (CRPs) address state inconsistency issues that arise when traditional RPs recover errors in interconnected applications. With CRPs, an application executing within RP transmits remote peer notified also perform checkpoint, so communicating entities checkpoint coordinated, but loosely coupled way. Notifications are sent successfully complete execution, recovery initiated, appropriate action performed by parties. developed tool implements CRPs dynamically instrumenting binaries transparently injecting notifications already established TCP channels between tested our with various applications, including MySQL Apache servers, show allows them errors, while incurring moderate overhead 4.54% 71.56%.

参考文章(31)
Angelos D. Keromytis, Characterizing Software Self-healing Systems mathematical methods models and architectures for network security systems. pp. 22- 33 ,(2007) , 10.1007/978-3-540-73986-9_2
George Candea, Armando Fox, None, Crash-only software hot topics in operating systems. pp. 12- 12 ,(2003)
George W. Dunlap, Peter M. Chen, Samuel T. King, Debugging operating systems with time-traveling virtual machines usenix annual technical conference. pp. 1- 1 ,(2005)
Michael E. Locasto, Angelos D. Keromytis, Stelios Sidiroglou, Stephen W. Boyd, Building a reactive immune system for software services usenix annual technical conference. pp. 11- 11 ,(2005) , 10.7916/D86D6562
Martin Rinard, Cristian Cadar, William S. Beebee, Daniel M. Roy, Tudor Leu, Daniel Dumitran, Enhancing server availability and security through failure-oblivious computing operating systems design and implementation. pp. 21- 21 ,(2004)
Georgios Portokalidis, Angelos D. Keromytis, REASSURE: A Self-contained Mechanism for Healing Software Using Rescue Points Advances in Information and Computer Security. pp. 16- 32 ,(2011) , 10.1007/978-3-642-25141-2_2
James S. Plank, Kai Li, Micah Beck, Gerry Kingsley, Libckpt: transparent checkpointing under Unix usenix annual technical conference. pp. 18- 18 ,(1995)
Peter Saint-Andre, Out of Band Data XMPP Standards Foundation. ,(2006)
Cristian Cadar, Daniel Dunbar, Dawson Engler, KLEE: unassisted and automatic generation of high-coverage tests for complex systems programs operating systems design and implementation. pp. 209- 224 ,(2008) , 10.5555/1855741.1855756
Hiralal Agrawal, Richard A. Demillo, Eugene H. Spafford, Debugging with dynamic slicing and backtracking Software - Practice and Experience. ,vol. 23, pp. 589- 616 ,(1993) , 10.1002/SPE.4380230603