Efficient Mining of High Confidience Association Rules without Support Thresholds

作者: Jinyan Li , Xiuzhen Zhang , Guozhu Dong , Kotagiri Ramamohanarao , Qun Sun

DOI: 10.1007/978-3-540-48247-5_50

关键词:

摘要: Association rules describe the degree of dependence between items in transactional datasets by their confidences. In this paper, we first introduce problem mining top rules, namely those association with 100% confidence. Traditional approaches to need a minimum support (minsup) threshold and then can discover supports ≥ minsup; such approaches, however, rely on minsup help avoid examining too many candidates they miss whose are below minsup. The low (e.g. some unusual combinations factors that have always caused disease) may be very interesting. Fundamentally different from previous work, our proposed method uses dataset partitioning technique two border-based algorithms efficiently all given consequent, without constraint threshold. Importantly, use borders concisely represent instead enumerating them individually. We also discuss how zero-confidence high (say 90%) confidence using similar rules. Experimental results Mushroom, Cleveland heart disease, Boston housing reported evaluate efficiency approach.

参考文章(11)
Guozhu Dong, Jinyan Li, Xiuzhen Zhang, Discovering Jumping Emerging Patterns and Experiments on Real Datasets Proceedings of the 9th International Database Conference on Heterogeneous and Internet Databases. ,(1999)
Guozhu Dong, Jinyan Li, Interestingness of Discovered Association Rules in Terms of Neighborhood-Based Unexpectedness knowledge discovery and data mining. ,vol. 1394, pp. 72- 86 ,(1998) , 10.1007/3-540-64383-4_7
Keki B. Irani, Usama M. Fayyad, Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning international joint conference on artificial intelligence. ,vol. 2, pp. 1022- 1027 ,(1993)
Guozhu Dong, Jinyan Li, Efficient mining of emerging patterns: discovering trends and differences knowledge discovery and data mining. pp. 43- 52 ,(1999) , 10.1145/312129.312191
C. L. Blake, UCI Repository of machine learning databases www.ics.uci.edu/〜mlearn/MLRepository.html. ,(1998)
Jr. Roberto J. Bayardo, Brute-force mining of high-confidence classification rules knowledge discovery and data mining. pp. 123- 126 ,(1997)
R. Kohavi, G. John, R. Long, D. Manley, K. Pfleger, MLC++: a machine learning library in C++ international conference on tools with artificial intelligence. pp. 740- 743 ,(1994) , 10.1109/TAI.1994.346412
Rakesh Agrawal, Tomasz Imieliński, Arun Swami, Mining association rules between sets of items in large databases Proceedings of the 1993 ACM SIGMOD international conference on Management of data - SIGMOD '93. ,vol. 22, pp. 207- 216 ,(1993) , 10.1145/170035.170072
Rakesh Agrawal, Tomasz Imieliński, Arun Swami, Mining association rules between sets of items in large databases ACM SIGMOD Record. ,vol. 22, pp. 207- 216 ,(1993) , 10.1145/170036.170072