Comprehensive log compression with frequent patterns

作者: Kimmo Hätönen , Jean François Boulicaut , Mika Klemettinen , Markus Miettinen , Cyrille Masson

DOI: 10.1007/978-3-540-45228-7_36

关键词:

摘要: In this paper we present a comprehensive log compression (CLC) method that uses frequent patterns and their condensed representations to identify repetitive information from large files generated by communications networks. We also show how the identified can be used separate filter out frequently occurring events hide other, unique or only few times events. The identification done without any prior knowledge about domain For example, no pre-defined value combinations are needed. This separation makes it easier for human observer perceive analyse amounts of data. applicability CLC is demonstrated with real-world examples data communication

参考文章(18)
J. Pei, Jiawei Han, Runying Mao, CLOSET : An Efficient Algorithm for Mining Frequent Closed Itemsets international conference on management of data. pp. 21- 30 ,(2000)
Ronald J. Brachman, Tej Anand, The process of knowledge discovery in databases: a first sketch knowledge discovery and data mining. pp. 1- 11 ,(1994)
Jun Sese, Shinichi Morishita, Answering the Most Correlated N Association Rules Efficiently european conference on principles of data mining and knowledge discovery. pp. 410- 422 ,(2002) , 10.1007/3-540-45681-3_34
Tobias Scheffer, Finding association rules that trade support optimally against confidence european conference on principles of data mining and knowledge discovery. ,vol. 9, pp. 424- 435 ,(2001) , 10.1007/3-540-44794-6_35
Gregory Piatetsky-Shapiro, Usama M. Fayyad, Padhraic Smyth, From data mining to knowledge discovery: an overview knowledge discovery and data mining. pp. 1- 34 ,(1996)
Heikki Mannila, A. Inkeri Verkamo, Ramakrishnan Srikant, Hannu Toivonen, Rakesh Agrawal, Fast discovery of association rules knowledge discovery and data mining. pp. 307- 328 ,(1996)
Raymond Kosala, Hendrik Blockeel, Web mining research: a survey Sigkdd Explorations. ,vol. 2, pp. 1- 15 ,(2000) , 10.1145/360402.360406
Usama Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, The KDD process for extracting useful knowledge from volumes of data Communications of the ACM. ,vol. 39, pp. 27- 34 ,(1996) , 10.1145/240455.240464
Artur Bykowski, Christophe Rigotti, A condensed representation to find frequent patterns symposium on principles of database systems. pp. 267- 273 ,(2001) , 10.1145/375551.375604
Jaideep Srivastava, Robert Cooley, Mukund Deshpande, Pang-Ning Tan, Web usage mining ACM SIGKDD Explorations Newsletter. ,vol. 1, pp. 12- 23 ,(2000) , 10.1145/846183.846188