Efficient high utility itemset mining using buffered utility-lists

作者: Quang-Huy Duong , Philippe Fournier-Viger , Heri Ramampiaro , Kjetil Nørvåg , Thu-Lan Dam

DOI: 10.1007/S10489-017-1057-2

关键词:

摘要: Discovering high utility itemsets in transaction databases is a key task for studying the behavior of customers. It consists finding groups items bought together that yield profit. Several algorithms have been proposed to mine using various approaches and more or less complex data structures. Among existing algorithms, one-phase employing utility-list structure shown be most efficient. In recent years, simplicity has led development numerous based tasks related mining. However, major limitation creating maintaining utility-lists are time consuming can consume huge amount memory. The reasons lists built intersection/join operation construct costly. This paper addresses this issue by proposing an improved called buffer reduce memory consumption speed up join operation. integrated into novel algorithm named ULB-Miner (Utility-List Buffer itemset Miner), which introduces several new ideas efficiently discover itemsets. uses designed store retrieve utility-lists, reuse during mining process. Moreover, also linear method constructing segments buffer. An extensive experimental study on datasets shows relying highly efficient terms both execution consumption. 10 times faster than FHM HUI-Miner consumes 6 it performs well dense sparse datasets.

参考文章(37)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Philippe Fournier-Viger, Souleymane Zida, FOSHU: faster on-shelf high utility itemset mining -- with or without negative unit profit acm symposium on applied computing. pp. 857- 864 ,(2015) , 10.1145/2695664.2695823
Wei Song, Yu Liu, Jinhong Li, Mining high utility itemsets by dynamically pruning the tree structure Applied Intelligence. ,vol. 40, pp. 29- 43 ,(2014) , 10.1007/S10489-013-0443-7
Unil Yun, Heungmo Ryang, Keun Ho Ryu, High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates Expert Systems With Applications. ,vol. 41, pp. 3861- 3878 ,(2014) , 10.1016/J.ESWA.2013.11.038
Vincent S. Tseng, Bai-En Shie, Cheng-Wei Wu, Philip S. Yu, Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases IEEE Transactions on Knowledge and Data Engineering. ,vol. 25, pp. 1772- 1786 ,(2013) , 10.1109/TKDE.2012.59
Guo-Cheng Lan, Tzung-Pei Hong, Vincent S. Tseng, An efficient projection-based indexing approach for mining high utility itemsets Knowledge and Information Systems. ,vol. 38, pp. 85- 107 ,(2014) , 10.1007/S10115-012-0492-Y
Yu-Cheng Liu, Chun-Pei Cheng, Vincent S Tseng, Mining differential top-k co-expression patterns from time course comparative gene expression datasets BMC Bioinformatics. ,vol. 14, pp. 230- 230 ,(2013) , 10.1186/1471-2105-14-230
M. Thilagu, R. Nadarajan, Efficiently Mining of Effective Web Traversal Patterns with Average Utility Procedia Technology. ,vol. 6, pp. 444- 451 ,(2012) , 10.1016/J.PROTCY.2012.10.053
Wei Song, Yu Liu, Jinhong Li, BAHUI: Fast and Memory Efficient Mining of High Utility Itemsets Based on Bitmap International Journal of Data Warehousing and Mining. ,vol. 10, pp. 1- 15 ,(2014) , 10.4018/IJDWM.2014010101
Mohammed J. Zaki, Karam Gouda, Fast vertical mining using diffsets knowledge discovery and data mining. pp. 326- 335 ,(2003) , 10.1145/956750.956788