EFFICIENTLY MINING HIGH AVERAGE-UTILITY ITEMSETS WITH AN IMPROVED UPPER-BOUND STRATEGY

作者: GUO-CHENG LAN , TZUNG-PEI HONG , VINCENT S. TSENG

DOI: 10.1142/S0219622012500307

关键词: Field (computer science)Process (computing)Measure (mathematics)Upper and lower boundsMathematicsValue (computer science)Utility miningPrefixData mining

摘要: Utility mining has recently been discussed in the field of data mining. A utility itemset considers both profits and quantities items transactions, thus its value increases with increasing length. To reveal a better effect, an average-utility measure, which is total divided by length, proposed. However, existing approaches use traditional upper-bound model to find high itemsets, generate large number unpromising candidates process. The present study proposes improved approach that uses prefix concept create tighter upper bounds values for reducing itemsets Results from experiments on two real databases show proposed algorithm outperforms other algorithms under various parameter settings.

参考文章(23)
Vincent S. Tseng, Chun-Jung Chu, Tyne Liang, MINING TEMPORAL RARE UTILITY ITEMSETS IN LARGE DATABASES USING RELATIVE UTILITY THRESHOLDS International Journal of Innovative Computing Information and Control. ,vol. 4, pp. 2775- 2792 ,(2008)
Dao-I Lin, Zvi M. Kedem, Pincer-Search: A New Algorithm for Discovering the Maximum Frequent Set extending database technology. pp. 105- 119 ,(1998) , 10.1007/BFB0100980
Chun-Wei Lin, Guo-Cheng Lan, Tzung-Pei Hong, An incremental mining algorithm for high utility itemsets Expert Systems With Applications. ,vol. 39, pp. 7173- 7180 ,(2012) , 10.1016/J.ESWA.2012.01.072
Hong Yao, Howard J. Hamilton, Mining itemset utilities from transaction databases data and knowledge engineering. ,vol. 59, pp. 603- 626 ,(2006) , 10.1016/J.DATAK.2005.10.004
Guo-Cheng Lan, Tzung-Pei Hong, Vincent S. Tseng, Discovery of high utility itemsets from on-shelf time periods of products Expert Systems With Applications. ,vol. 38, pp. 5851- 5857 ,(2011) , 10.1016/J.ESWA.2010.11.040
YI PENG, GANG KOU, YONG SHI, ZHENGXIN CHEN, A descriptive framework for the field of data mining and knowledge discovery International Journal of Information Technology and Decision Making. ,vol. 07, pp. 639- 682 ,(2007) , 10.1142/S0219622008003204
Hua-Fu Li, Hsin-Yun Huang, Suh-Yin Lee, Fast and memory efficient mining of high-utility itemsets from data streams: with and without negative item profits Knowledge and Information Systems. ,vol. 28, pp. 495- 522 ,(2011) , 10.1007/S10115-010-0330-Z
Tzung-Pei Hong, Cho-Han Lee, Shyue-Liang Wang, None, Effective utility mining with the measure of average utility Expert Systems With Applications. ,vol. 38, pp. 8259- 8265 ,(2011) , 10.1016/J.ESWA.2011.01.006
YING LIU, JIANWEI LI, WEI-KENG LIAO, ALOK CHOUDHARY, YONG SHI, High utility itemsets mining International Journal of Information Technology and Decision Making. ,vol. 9, pp. 905- 934 ,(2010) , 10.1142/S0219622010004159
Jiawei Han, Jian Pei, Yiwen Yin, Mining frequent patterns without candidate generation international conference on management of data. ,vol. 29, pp. 1- 12 ,(2000) , 10.1145/335191.335372