Differential analysis of Operating System indicators for anomaly detection in dependable systems: An experimental study

作者: Andrea Bondavalli , Andrea Ceccarelli , Francesco Brancati , Diego Santoro , Michele Vadursi

DOI: 10.1016/J.MEASUREMENT.2015.11.010

关键词: Reliability (computer networking)Anomaly detectionReal-time computingFault toleranceOperating systemFault detection and isolationSystem monitoringDependabilityAir traffic managementEngineeringIdentification (information)

摘要: Abstract Dependable complex systems often operate under variable and non-stationary conditions, which requires efficient extensive monitoring error detection solutions. Among the many, paper focuses on anomaly techniques, monitor evolution of some specific indicators through time to identify anomalies, i.e. deviations from expected operational behavior. The timely identification anomalies in dependable, fault tolerant allows detect errors services react appropriately. In this paper, we investigate possibility using random walk model belonging Operating Systems, specifically our study Linux Red Hat EL5. approach is based experimental evaluation a large set heterogeneous indicators, are acquired different operating both terms workload faultload, an air traffic management target system. statistical analysis best-fitting aiming minimize integral distance between empirical data distribution reference distributions. outcomes show that idea adopting for development critical operates at System level promising. Moreover, standard distributions such as Laplace Cauchy, rather than Normal, should be used setting up thresholds monitor. Further studies involve new application, layer (an Application Server) will allow verifying generalization other systems, monitored layers indicators.

参考文章(31)
Larry Peterson, Sapan Bhatia, Abhishek Kumar, Marc E. Fiuczynski, Lightweight, high-resolution monitoring for troubleshooting production systems operating systems design and implementation. pp. 103- 116 ,(2008) , 10.5555/1855741.1855749
Andrea Bondavalli, Andrea Ceccarelli, Florjan Gogaj, Andrea Seminatore, Michele Vadursi, Experimental assessment of low-cost GPS-based localization in railway worksite-like scenarios Measurement. ,vol. 46, pp. 456- 466 ,(2013) , 10.1016/J.MEASUREMENT.2012.08.001
Andrea Bondavalli, Francesco Brancati, Andrea Ceccarelli, Diego Santoro, Michele Vadursi, Experimental analysis of the first order time difference of indicators used in the monitoring of complex systems 2013 IEEE International Workshop on Measurements & Networking (M&N). pp. 138- 142 ,(2013) , 10.1109/IWMN.2013.6663792
Qiang Guan, Song Fu, Adaptive Anomaly Identification by Exploring Metric Subspace in Cloud Computing Infrastructures symposium on reliable distributed systems. pp. 205- 214 ,(2013) , 10.1109/SRDS.2013.29
Edward Chuah, Arshad Jhumka, Sai Narasimhamurthy, John Hammond, James C. Browne, Bill Barth, Linking Resource Usage Anomalies with System Failures from Cluster Log Data symposium on reliable distributed systems. pp. 111- 120 ,(2013) , 10.1109/SRDS.2013.20
Antonio Bovenzi, Stefano Russo, Francesco Brancati, Andrea Bondavalli, Towards identifying OS-level anomalies to detect application software failures 2011 IEEE International Workshop on Measurements and Networking Proceedings (M&N). pp. 71- 76 ,(2011) , 10.1109/IWMN.2011.6088494
Andrea Bondavalli, Francesco Brancati, Alessandra Flammini, Stefano Rinaldi, Master Failure Detection Protocol in Internal Synchronization Environment IEEE Transactions on Instrumentation and Measurement. ,vol. 62, pp. 4- 12 ,(2013) , 10.1109/TIM.2012.2209916