Hora: Architecture-aware online failure prediction

作者: Teerat Pitakrat , Dušan Okanović , André van Hoorn , Lars Grunske

DOI: 10.1016/J.JSS.2017.02.041

关键词:

摘要: Abstract Complex software systems experience failures at runtime even though a lot of effort is put into the development and operation. Reactive approaches detect these after they have occurred already caused serious consequences. In order to execute proactive actions, goal online failure prediction in advance by monitoring quality service or system events. Current look individual components as monolith without considering architecture system. They disregard fact that one component can propagate through cause problems other components. this paper, we propose hierarchical approach, called Hora , which combines predictors with architectural knowledge. The propagation modeled using Bayesian networks incorporate both results dependencies extracted from models. Our approach evaluated Netflix’s server-side distributed RSS reader application predict three representative types faults: memory leak, overload, sudden node crash. We compare monolithic show our improve area under ROC curve 9.9%.

参考文章(66)
Ray Walker, Examining load average Linux Journal. ,vol. 2006, pp. 5- ,(2006)
Vittorio Cortellessa, Antinisca Di Marco, Paola Inverardi, Model-Based Software Performance Analysis ,(2011)
Len Bass, Ingo Weber, Liming Zhu, Devops: A Software Architect's Perspective ,(2015)
Robert H. Shumway, David S. Stoffer, Time series analysis and its applications ,(2000)
John D. Musa, Geoffrey A. Wilson, A. Frank Ackerman, William W. Everett, Software Reliability Engineering ,(1998)
Muhammad Ali Babar, L. Zhu, Ian Gorton, Comparison of scenario-based software architecture evaluation methods asia-pacific software engineering conference. pp. 600- 607 ,(2004) , 10.1109/APSEC.2004.38
Mike Hibler, Robert Ricci, Leigh Stoller, Jonathon Duerig, Shashi Guruprasad, Tim Stack, Kirk Webb, Jay Lepreau, None, Large-scale virtualization in the Emulab network testbed usenix annual technical conference. pp. 113- 128 ,(2008)