Answering the Most Correlated N Association Rules Efficiently

作者: Jun Sese , Shinichi Morishita

DOI: 10.1007/3-540-45681-3_34

关键词: Computer scienceAssociation rule learningSimple (abstract algebra)Tree (data structure)Property (programming)Pruning (decision trees)Data miningMetric (mathematics)Heuristics

摘要: Many algorithms have been proposed for computing association rules using the support-confidence framework. One drawback of this framework is its weakness in expressing notion correlation. We propose an efficient algorithm mining that uses statistical metrics to determine The simple application conventional techniques developed not possible, since functions correlation do meet anti-monotonicity property crucial traditional methods. In paper, we heuristics vertical decomposition a database, pruning unproductive itemsets, and traversing set-enumeration tree itemsets tailored calculation N most significant rules, where can be specified by user. experimentally compared combination these three with previous approach. Our tests confirmed comutational performance improves several orders magnitude.

参考文章(19)
Srinivasan Parthasarathy, Mitsunori Ogihara, Mohammed J Zaki, Wei Li, New algorithms for fast discovery of association rules knowledge discovery and data mining. pp. 283- 286 ,(1997)
Ramakrishnan Srikant, Rakesh Agrawal, Fast algorithms for mining association rules very large data bases. pp. 580- 592 ,(1998)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Ron Rymon, Search through systematic set enumeration principles of knowledge representation and reasoning. pp. 539- 550 ,(1992)
Charu C. Aggarwal, Philip S. Yu, A new framework for itemset generation symposium on principles of database systems. pp. 18- 24 ,(1998) , 10.1145/275487.275490
Pradeep Shenoy, Jayant R. Haritsa, S. Sudarshan, Gaurav Bhalotia, Mayank Bawa, Devavrat Shah, Turbo-charging vertical mining of large databases international conference on management of data. ,vol. 29, pp. 22- 33 ,(2000) , 10.1145/335191.335376
Shinichi Morishita, Jun Sese, Transversing itemset lattices with statistical metric pruning symposium on principles of database systems. pp. 226- 236 ,(2000) , 10.1145/335168.335226
Jong Soo Park, Ming-Syan Chen, Philip S. Yu, An effective hash-based algorithm for mining association rules international conference on management of data. ,vol. 24, pp. 175- 186 ,(1995) , 10.1145/223784.223813
Sergey Brin, Rajeev Motwani, Jeffrey D. Ullman, Shalom Tsur, Dynamic itemset counting and implication rules for market basket data international conference on management of data. ,vol. 26, pp. 255- 264 ,(1997) , 10.1145/253260.253325
Bing Liu, Wynne Hsu, Yiming Ma, Pruning and summarizing the discovered associations knowledge discovery and data mining. pp. 125- 134 ,(1999) , 10.1145/312129.312216