Answering the Most Correlated N Association Rules Efficiently

关键词: Computer science 、 Association rule learning 、 Simple (abstract algebra) 、 Tree (data structure) 、 Property (programming) 、 Pruning (decision trees) 、 Data mining 、 Metric (mathematics) 、 Heuristics

摘要: Many algorithms have been proposed for computing association rules using the support-confidence framework. One drawback of this framework is its weakness in expressing notion correlation. We propose an efficient algorithm mining that uses statistical metrics to determine The simple application conventional techniques developed not possible, since functions correlation do meet anti-monotonicity property crucial traditional methods. In paper, we heuristics vertical decomposition a database, pruning unproductive itemsets, and traversing set-enumeration tree itemsets tailored calculation N most significant rules, where can be specified by user. experimentally compared combination these three with previous approach. Our tests confirmed comutational performance improves several orders magnitude.

参考文章(19)

Srinivasan Parthasarathy, Mitsunori Ogihara, Mohammed J Zaki, Wei Li, New algorithms for fast discovery of association rules knowledge discovery and data mining. pp. 283- 286 ,(1997)

Ramakrishnan Srikant, Rakesh Agrawal, Fast algorithms for mining association rules very large data bases. pp. 580- 592 ,(1998)

Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)

Ron Rymon, Search through systematic set enumeration principles of knowledge representation and reasoning. pp. 539- 550 ,(1992)

Charu C. Aggarwal, Philip S. Yu, A new framework for itemset generation symposium on principles of database systems. pp. 18- 24 ,(1998) , 10.1145/275487.275490

Pradeep Shenoy, Jayant R. Haritsa, S. Sudarshan, Gaurav Bhalotia, Mayank Bawa, Devavrat Shah, Turbo-charging vertical mining of large databases international conference on management of data. ,vol. 29, pp. 22- 33 ,(2000) , 10.1145/335191.335376

Shinichi Morishita, Jun Sese, Transversing itemset lattices with statistical metric pruning symposium on principles of database systems. pp. 226- 236 ,(2000) , 10.1145/335168.335226

Jong Soo Park, Ming-Syan Chen, Philip S. Yu, An effective hash-based algorithm for mining association rules international conference on management of data. ,vol. 24, pp. 175- 186 ,(1995) , 10.1145/223784.223813

Sergey Brin, Rajeev Motwani, Jeffrey D. Ullman, Shalom Tsur, Dynamic itemset counting and implication rules for market basket data international conference on management of data. ,vol. 26, pp. 255- 264 ,(1997) , 10.1145/253260.253325

10.

Bing Liu, Wynne Hsu, Yiming Ma, Pruning and summarizing the discovered associations knowledge discovery and data mining. pp. 125- 134 ,(1999) , 10.1145/312129.312216

Answering the Most Correlated N Association Rules Efficiently

来源期刊

我的账户

Answering the Most Correlated N Association Rules Efficiently

来源期刊

相似文章 10

我的账户