Beyond Market Baskets: Generalizing Association Rules to Dependence Rules

作者: Craig Silverstein , Sergey Brin , Rajeev Motwani

DOI: 10.1023/A:1009713703947

关键词: Market basketAssociation rule learningPurchasingEfficient algorithmData miningApriori algorithmSynthetic dataMathematics

摘要: One of the more well-studied problems in data mining is search for association rules market basket data. Association are intended to identify patterns type: “A customer purchasing item A often also purchases B.” Motivated partly by goal generalizing beyond and ironing out some definition rules, we develop notion dependence that statistical both presence absence items itemsets. We propose measuring significance via chi-squared test independence from classical statistics. This leads a measure upward-closed itemset lattice, enabling us reduce problem border between dependent independent itemsets lattice. pruning strategies based on closure property thereby devise an efficient algorithm discovering rules. demonstrate our algorithm‘s effectiveness testing it census data, text (wherein seek term dependence), synthetic

参考文章(25)
Heikki Mannila, A. Inkeri Verkamo, Hannu Toivonen, Efficient algorithms for discovering association rules knowledge discovery and data mining. pp. 181- 192 ,(1994)
M.A.W. Houtsma, A. Swami, Set-Oriented Mining for Association Rules IBM Research Report RJ 9567. ,(1993)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Ramakrishnan Srikant, Rakesh Agrawal, Mining Generalized Association Rules very large data bases. pp. 407- 419 ,(1995)
Heikki Mannila, A. Inkeri Verkamo, Ramakrishnan Srikant, Hannu Toivonen, Rakesh Agrawal, Fast discovery of association rules knowledge discovery and data mining. pp. 307- 328 ,(1996)
Heikki Mannila, Sanjeev Saluja, Dimitrios Gunopulos, Discovering All Most Specific Sentences by Randomized Algorithms international conference on database theory. pp. 215- 229 ,(1997)
Jiawei Han, Yongjian Fu, Discovery of Multiple-Level Association Rules from Large Databases very large data bases. pp. 420- 431 ,(1995)
Shamkant B. Navathe, Edward Omiecinski, Ashoka Savasere, An Efficient Algorithm for Mining Association Rules in Large Databases very large data bases. pp. 432- 444 ,(1995)
Hannu Toivonen, Sampling Large Databases for Association Rules very large data bases. pp. 134- 145 ,(1996)
William J Frawley, Christopher J Matheus, Knowledge Discovery in Databases MIT Press. ,(1991)