Interestingness measures for association rules based on statistical validity

作者: Izwan Nizal Mohd. Shaharanee , Fedja Hadzic , Tharam S. Dillon

DOI: 10.1016/J.KNOSYS.2010.11.005

关键词:

摘要: Assessing rules with interestingness measures is the pillar of successful application association discovery. However, discovered are normally large in number, some which not considered as interesting or significant for at hand. In this paper, we present a systematic approach to ascertain rules, and provide precise statistical supporting framework. The proposed strategy combines data mining measurement techniques, including redundancy analysis, sampling multivariate discard non- rules. Moreover, consider real world datasets characterized by uniform non-uniform data/items distribution mixture levels throughout data/items. unified framework applied on these demonstrate its effectiveness discarding many redundant non-significant while still preserving high accuracy rule set whole.

参考文章(37)
Nimrod Megiddo, Ramakrishnan Srikant, Discovering predictive association rules knowledge discovery and data mining. pp. 274- 278 ,(1998)
Izwan Nizal Mohd Shaharanee, Fedja Hadzic, Tharam S. Dillon, Interestingness of Association Rules Using Symmetrical Tau and Logistic Regression australasian joint conference on artificial intelligence. ,vol. 5866, pp. 422- 431 ,(2009) , 10.1007/978-3-642-10439-8_43
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Yonatan Aumann, Yehuda Lindell, A Statistical Theory for Quantitative Association Rules intelligent information systems. ,vol. 20, pp. 255- 283 ,(2003) , 10.1023/A:1022812808206
Stephane Lallich, Olivier Teytaud, Elie Prudhomme, Association Rule Interestingness: Measure and Statistical Validation Quality Measures in Data Mining. pp. 251- 275 ,(2007) , 10.1007/978-3-540-44918-8_11
Stephen D. Bay, Michael J. Pazzani, Detecting Group Differences: Mining Contrast Sets Data Mining and Knowledge Discovery. ,vol. 5, pp. 213- 246 ,(2001) , 10.1023/A:1011429418057
T. S. Dillon, T. Hossain, W. Bloomer, M. Witten, Improvements in supervised BRAINNE: A method for symbolic datamining using neural networks Data Mining and Reverse Engineering. pp. 67- 88 ,(1998) , 10.1007/978-0-387-35300-5_4
Stanley Lemeshow, David W. Hosmer, Applied Logistic Regression ,(1989)
Wilhelmiina Hämäläinen, Matti Nykänen, Efficient Discovery of Statistically Significant Association Rules international conference on data mining. pp. 203- 212 ,(2008) , 10.1109/ICDM.2008.144