Frequent Itemset Discovery.

作者: Marko Salmenkivi

DOI:

关键词:

摘要: Consider the set of all products sold by a supermarket. Assume that owner supermarker is interested in finding out subsets are often purchased together. Each customer transaction stored database, indicating The database can be described as table, whose columns (items), and rows transactions. value specific entry, is, (row, column)-pair, table 1 if corresponding product was transaction, 0 otherwise. task to find itemsets such items frequently occur same row (products together). most important interestingness measure frequent itemset mining support an itemset. It defined fraction contain x∈X. An its exceeds user-specified threshold value. Association rules closely related pattern class. Let R products, r X, Y ⊆ itemsets. Then X→ association rule over r. usually measured support(X→ Y)= support(X ∪ Y), confidence: conf (X → Y) = support(X∪Y,r) support(X,r) . Thus, confidence conditional probability randomly chosen from X also Given thresholds for confidence, given rules, supports confidences exceed thresholds.

参考文章(4)
Bart Goethals, Juho Muhonen, Hannu Toivonen, Mining Non-Derivable Association Rules. siam international conference on data mining. pp. 239- 249 ,(2005)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Nicolas Pasquier, Yves Bastide, Rafik Taouil, Lotfi Lakhal, Discovering Frequent Closed Itemsets for Association Rules international conference on database theory. ,vol. 1540, pp. 398- 416 ,(1999) , 10.1007/3-540-49257-7_25
M.J. Zaki, Scalable algorithms for association mining IEEE Transactions on Knowledge and Data Engineering. ,vol. 12, pp. 372- 390 ,(2000) , 10.1109/69.846291