Friday: global comprehension for distributed replay

作者: Dennis Geels , Gautam Altekar , Petros Maniatis , Timothy Roscoe , Ion Stoica

DOI:

关键词:

摘要: Debugging and profiling large-scale distributed applications is a daunting task. We present Friday, system for debugging that combines deterministic replay of components with the power symbolic, low-level simple language expressing higher-level conditions actions. Friday allows programmer to understand collective state dynamics collection coordinated application components. To evaluate we consider several problems, including routing consistency in overlay networks, temporal abnormalities caused by route flaps. show via micro-benchmarks larger-scale measurement can be used interactively debug large under on common hardware.

参考文章(32)
Srikanth Kandula, Yuanyuan Zhou, Sudarshan M. Srinivasan, Christopher R. Andrews, Flashback: a lightweight extension for rollback and deterministic replay for software debugging usenix annual technical conference. pp. 3- 3 ,(2004)
Spiros Xanthos, Shan Lu, Yuanyuan Zhou, Joseph Tucek, Chengdu Huang, Automatic on-line failure diagnosis at the end-user site hot topics in system dependability. pp. 4- 4 ,(2006)
Dominic G. Lucchetti, Peter M. Chen, Zhuoqing Morley Mao, Samuel T. King, Enriching Intrusion Alerts Through Multi-Host Causality. network and distributed system security symposium. ,(2005)
George W. Dunlap, Peter M. Chen, Samuel T. King, Debugging operating systems with time-traveling virtual machines usenix annual technical conference. pp. 1- 1 ,(2005)
Richard Mortier, Rebecca Isaacs, Austin Donnelly, Paul Barham, Using magpie for request extraction and workload modelling operating systems design and implementation. pp. 18- 18 ,(2004)
Scott Shenker, George Porter, Ion Stoica, Randy H. Katz, Rodrigo Fonseca, X-trace: a pervasive network tracing framework networked systems design and implementation. pp. 20- 20 ,(2007)
Robbert van Renesse, Kenneth Birman, Dan Dumitriu, Werner Vogels, Scalable Management and Data Mining Using Astrolabe international workshop on peer to peer systems. pp. 280- 294 ,(2002) , 10.1007/3-540-45748-8_27
Xuezheng Liu, Zheng Zhang, Wei Lin, Aimin Pan, WiDS checker: combating bugs in distributed systems networked systems design and implementation. pp. 19- 19 ,(2007)
Amin Vahdat, Ranjit Jhala, Charles Killian, James W. Anderson, Life, death, and the critical transition: finding liveness bugs in systems code networked systems design and implementation. pp. 18- 18 ,(2007)
Eric Brewer, Emre Kiciman, Mike Y. Chen, Armando Fox, Anthony Accardi, Jim Lloyd, Dave Patterson, Path-based faliure and evolution management networked systems design and implementation. pp. 23- 23 ,(2004)