Detecting and monitoring abrupt emergences and submergences of episodes over data streams

作者: Min Gan , Honghua Dai

DOI: 10.1016/J.IS.2012.05.009

关键词: Data miningData stream miningReal-time computingComputer scienceStream processingSynthetic data

摘要: Existing studies on episode mining mainly concentrate the discovery of (global) frequent episodes in sequences. However, are not suited for data streams because they do capture dynamic nature streams. This paper focuses detecting changes frequencies over time-evolving We propose an efficient method online detection abrupt emerging and submerging Experimental results synthetic show that proposed can effectively detect defined patterns meet strict requirements stream processing, such as one-pass, real-time update return results, plus limited time space consumption. real demonstrate detected by our natural meaningful. The has wide applications monitoring analysis discovered indicate emergences/disappearances noteworthy events/phenomena hidden

参考文章(24)
Heikki Mannila, A. Inkeri Verkamo, Hannu Toivonen, Discovering Frequent Episodes in Sequences. knowledge discovery and data mining. pp. 210- 215 ,(1995)
Heikki Mannila, Hannu Toivonen, Discovering generalized episodes using minimal occurrences knowledge discovery and data mining. pp. 146- 151 ,(1996)
Nicolas Méger, Christophe Rigotti, Constraint-based mining of episode rules and optimal window sizes european conference on principles of data mining and knowledge discovery. pp. 313- 324 ,(2004) , 10.1007/978-3-540-30116-5_30
Sherri K. Harms, Jitender S. Deogun, Sequential Association Rule Mining with Time Lags intelligent information systems. ,vol. 22, pp. 7- 22 ,(2004) , 10.1023/A:1025824629047
Alfredo Cuzzocrea, CAMS: OLAPing Multidimensional Data Streams Efficiently data warehousing and knowledge discovery. ,vol. 5691, pp. 48- 62 ,(2009) , 10.1007/978-3-642-03730-6_5
Zhiping Zeng, Jianyong Wang, Lizhu Zhou, Efficient mining of minimal distinguishing subgraph patterns from graph databases knowledge discovery and data mining. pp. 1062- 1068 ,(2008) , 10.1007/978-3-540-68125-0_114
Gemma Casas-Garriga, Discovering Unbounded Episodes in Sequential Data european conference on principles of data mining and knowledge discovery. pp. 83- 94 ,(2003) , 10.1007/978-3-540-39804-2_10
Mohammed J. Zaki, SPADE: An Efficient Algorithm for Mining Frequent Sequences Machine Learning. ,vol. 42, pp. 31- 60 ,(2001) , 10.1023/A:1007652502315
Alfredo Cuzzocrea, Filippo Furfaro, Giuseppe M. Mazzeo, Domenico Saccà, A Grid Framework for Approximate Aggregate Query Answering on Summarized Sensor Network Readings On the Move to Meaningful Internet Systems 2004: OTM 2004 Workshops. pp. 144- 153 ,(2004) , 10.1007/978-3-540-30470-8_32