MapReduce-guided scalable compressed dictionary construction for evolving repetitive sequence streams

作者: Pallabi Parveen , Pratik Desai , Bhavani Thuraisingham , Latifur Khan

DOI: 10.4108/ICST.COLLABORATECOM.2013.254135

关键词:

摘要: Users' repetitive daily or weekly activities may constitute user profiles. For example, a user's frequent command sequences represent normative pattern of that user. To find patterns over dynamic data streams unbounded length is challenging. this, an unsupervised learning approach proposed in our prior work by exploiting compressed/quantized dictionary to model common behavior sequences. This suffers scalability issues. Hence, this paper, we propose and implement MapReduce-based framework construct quantized dictionary. We show effectiveness distributed parallel solution on benchmark dataset.

参考文章(18)
Hans W. Guesgen, Stephen Marsland, Sook-Ling Chua, Unsupervised learning of patterns in data streams using compression and edit distance international joint conference on artificial intelligence. pp. 1231- 1236 ,(2011) , 10.5591/978-1-57735-516-8/IJCAI11-209
Saul Greenberg, USING UNIX: COLLECTED TRACES OF 168 USERS University of Calgary. ,(1988) , 10.11575/PRISM/30806
V. I. Levenshtein, Binary codes capable of correcting deletions, insertions, and reversals Soviet physics. Doklady. ,vol. 10, pp. 707- 710 ,(1966)
Ahsanul Haque, Brandon Parker, Latifur Khan, Labeling Instances in Evolving Data Streams with MapReduce international congress on big data. pp. 387- 394 ,(2013) , 10.1109/BIGDATA.CONGRESS.2013.58
Tahseen M Al-Khateeb, Mohammad M Masud, Latifur Khan, Bhavani Thuraisingham, None, Cloud Guided Stream Classification Using Class-Based Ensemble international conference on cloud computing. pp. 694- 701 ,(2012) , 10.1109/CLOUD.2012.127
Yehuda Vardi, Martin Theusan, Alan F. Karr, Wen-Hua Ju, William DuMouchel, Matthias Schonlau, Computer Intrusion: Detecting Masquerades Statistical Science. ,vol. 16, pp. 58- 74 ,(2001) , 10.1214/SS/998929476
Pallabi Parveen, Nate McDaniel, Varun S. Hariharan, Bhavani Thuraisingham, Latifur Khan, Unsupervised Ensemble Based Learning for Insider Threat Detection privacy security risk and trust. pp. 718- 727 ,(2012) , 10.1109/SOCIALCOM-PASSAT.2012.106
Pallabi Parveen, Bhavani Thuraisingham, Unsupervised incremental sequence learning for insider threat detection intelligence and security informatics. pp. 141- 143 ,(2012) , 10.1109/ISI.2012.6284271
Welch, A Technique for High-Performance Data Compression IEEE Computer. ,vol. 17, pp. 8- 19 ,(1984) , 10.1109/MC.1984.1659158