FATE and DESTINI: a framework for cloud recovery testing

作者: Remzi H. Arpaci-Dusseau , Haryadi S. Gunawi , Andrea C. Arpaci-Dusseau , Dhruba Borthakur , Joseph M. Hellerstein

DOI: 10.5555/1972457.1972482

关键词:

摘要: As the cloud era begins and failures become commonplace, failure recovery becomes a critical factor in availability, reliability performance of services. Unfortunately, problems still take place, causing downtimes, data loss, many other problems. We propose new testing framework for recovery: FATE (Failure Testing Service) DESTINI (Declarative Specifications). With FATE, is systematically tested face multiple failures. DESTINI, correct specified clearly, concisely, precisely. have integrated our to several systems (e.g., HDFS [33]), explored over 40,000 scenarios, wrote 74 specifications, found 16 bugs, reproduced 51 old bugs.

参考文章(28)
Mahadev Konar, Benjamin Reed, Flavio P. Junqueira, Patrick Hunt, ZooKeeper: wait-free coordination for internet-scale systems usenix annual technical conference. pp. 11- 11 ,(2010)
Zhenyu Guo, Feibo Chen, Xuezheng Liu, Xi Wang, Ming Wu, Zheng Zhang, Xiaochen Lian, Jian Tang, M. Frans Kaashoek, D 3 S: debugging deployed distributed systems networked systems design and implementation. pp. 423- 437 ,(2008)
James Hamilton, On designing and deploying internet-scale services usenix large installation systems administration conference. pp. 18- ,(2007)
Xuezheng Liu, Zheng Zhang, Wei Lin, Aimin Pan, WiDS checker: combating bugs in distributed systems networked systems design and implementation. pp. 19- 19 ,(2007)
Amin Vahdat, Ranjit Jhala, Charles Killian, James W. Anderson, Life, death, and the critical transition: finding liveness bugs in systems code networked systems design and implementation. pp. 18- 18 ,(2007)
Sudhakar Govindavajhala, Xinming Ou, Andrew W. Appel, MulVAL: a logic-based network security analyzer usenix security symposium. pp. 8- 8 ,(2005)
Junfeng Yang, Tisheng Chen, Mao Yang, Fan Long, Zhilei Xu, Haoxiang Lin, Lintao Zhang, Lidong Zhou, Xuezheng Liu, Ming Wu, MODIST: transparent model checking of unmodified distributed systems networked systems design and implementation. pp. 213- 228 ,(2009)
Remzi H. Arpaci-Dusseau, Haryadi S. Gunawi, Andrea C. Arpaci-Dusseau, Abhishek Rajimwale, SQCK: a declarative file system checker operating systems design and implementation. pp. 131- 146 ,(2008) , 10.5555/1855741.1855751
Viktor Kuncak, Dejan Kostic, Nikola Knezevic, Maysam Yabandeh, CrystalBall: predicting and preventing inconsistencies in deployed distributed systems networked systems design and implementation. pp. 229- 244 ,(2009)
Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber, Bigtable ACM Transactions on Computer Systems. ,vol. 26, pp. 1- 26 ,(2008) , 10.1145/1365815.1365816