Efficiently Extracting Operational Profiles from Execution Logs Using Suffix Arrays

作者: Meiyappan Nagappan , Kesheng Wu , Mladen A. Vouk

DOI: 10.1109/ISSRE.2009.23

关键词: SuffixEngineering toolContext (language use)Computer scienceComputational complexity theoryCloud computingSoftwareConstruction industrySoftware qualityData mining

摘要: An important software reliability engineering tool is operational profiles. In this paper we propose a cost effective automated approach for creating second generation profiles using execution logs of product. Our algorithm parses the into sequences events and produces an ordered list all possible subsequences by constructing suffix-array events. The difficulty in that amount data needs to be analyzed often extremely large (more than million records per day many applications). very efficient. We show our requires O(N) space time discover patterns N discuss practical implementation context from cloud computing system.

参考文章(19)
Wei Xu, Ling Huang, Armando Fox, David A Patterson, Michael I Jordan, None, Mining console logs for large-scale system problem detection usenix workshop on tackling computer systems problems with machine learning techniques. pp. 4- 4 ,(2008)
Stephen E. Hansen, E. Todd Atkins, Automated System Monitoring and Notification With Swatch usenix large installation systems administration conference. pp. 145- 152 ,(1993)
G. Pokam, F. Bodin, An offline approach for whole-program paths analysis using suffix arrays ieee international conference on high performance computing data and analytics. pp. 363- 378 ,(2004) , 10.1007/11532378_26
Toru Kasai, Gunho Lee, Hiroki Arimura, Setsuo Arikawa, Kunsoo Park, Linear-Time Longest-Common-Prefix Computation in Suffix Arrays and Its Applications combinatorial pattern matching. pp. 181- 192 ,(2001) , 10.1007/3-540-48194-X_17
John D. Musa, The operational profile international symposium on software reliability engineering. pp. 167- 216 ,(1996) , 10.1007/978-3-662-03274-9_18
Abdelwahab Hamou-Lhadj, Timothy C. Lethbridge, A survey of trace exploration tools and techniques conference of the centre for advanced studies on collaborative research. pp. 42- 55 ,(2004)
R. Vaarandi, A data clustering algorithm for mining patterns from event logs ip operations and management. pp. 119- 126 ,(2003) , 10.1109/IPOM.2003.1251233
Udi Manber, Gene Myers, Suffix arrays: a new method for on-line string searches symposium on discrete algorithms. pp. 319- 327 ,(1990) , 10.5555/320176.320218
Meiyappan Nagappan, Mladen A. Vouk, Kesheng Wu, Alex Sim, Arie Shoshani, Efficient Operational Profiling of Systems Using Suffix Arrays on Execution Logs international symposium on software reliability engineering. pp. 313- 314 ,(2008) , 10.1109/ISSRE.2008.45
Karl Aberer, Christian Lovis, Mirko Steinle, Sarunas Girdzijauskas, Mapping moving landscapes by mining mountains of logs: novel techniques for dependency model generation very large data bases. pp. 1093- 1102 ,(2006) , 10.5555/1182635.1164221