Integration of Dataset Scans in Processing Sets of Frequent Itemset Queries

作者: Marek Wojciechowski , Maciej Zakrzewicz , Pawel Boinski

DOI: 10.1007/978-3-642-23166-7_9

关键词: Set (abstract data type)Data miningHypergraphDomain (software engineering)Association rule learningComputer scienceA priori and a posterioriArbitrarily largeConstraint (information theory)

摘要: Frequent itemset mining is often regarded as advanced querying where a user specifies the source dataset and pattern constraints using given constraint model. In this chapter we address problem of processing sets frequent queries, which brings ideas multiple-query optimization to domain data mining. The most attractive method solving with respect possible practical applications Common Counting consists in concurrent execution queries Apriori integration scans parts database shared among queries. major advantage over its alternatives applicability arbitrarily large batches If memory structures all be processed by do not fit together main memory, set has partitioned into subsets several phases. We formalize dividing for specific case hypergraph partitioning provide comprehensive overview query algorithms proposed so far.

参考文章(65)
Maciej Zakrzewicz, Marek Wojciechowski, Data Mining Query Scheduling for Apriori Common Counting ,(2004)
Pawel Boinski, Konrad Jozwiak, Marek Wojciechowski, Maciej Zakrzewicz, Improving Quality of Agglomerative Scheduling in Concurrent Processing of Frequent Itemset Queries intelligent information systems. pp. 233- 242 ,(2006) , 10.1007/3-540-33521-8_23
Maciej Zakrzewicz, Marek Wojciechowski, Methods for Batch Processing of Data Mining Queries Proceedings of the Baltic Conference, BalticDB&IS 2002 - Volume 1. pp. 225- 236 ,(2002)
Nebojsa Stefanovic, Yijun Lu, Jiawei Han, Yongjian Fu, Wan Gong, Krzysztof Koperski, Jenny Chiang, Osmar R. Zaiane, Betty Xia, Amynmohamed Rajan, Deyi Li, Wei Wang, DBMiner: a system for mining knowledge in large relational databases knowledge discovery and data mining. pp. 250- 255 ,(1996)
Maciej Zakrzewicz, Marek Wojciechowski, Evaluation of the Mine-Merge Method for Data Mining Query Processing. ADBIS (Local Proceedings). ,(2004)
George Karypis, Vipin Kumar, Multilevel Graph Partitioning Schemes. international conference on parallel processing. pp. 113- 122 ,(1995)
Ramakrishnan Srikant, Rakesh Agrawal, Fast algorithms for mining association rules very large data bases. pp. 580- 592 ,(1998)
Pawel Boinski, Marek Wojciechowski, Maciej Zakrzewicz, A greedy approach to concurrent processing of frequent itemset queries data warehousing and knowledge discovery. pp. 292- 301 ,(2006) , 10.1007/11823728_28
Mikołaj Morzy, Marek Wojciechowski, Maciej Zakrzewicz, Optimizing a Sequence of Frequent Pattern Queries Data Warehousing and Knowledge Discovery. pp. 448- 457 ,(2005) , 10.1007/11546849_44
Marek Wojciechowski, Maciej Zakrzewicz, Dataset Filtering Techniques in Constraint-Based Frequent Pattern Mining Lecture Notes in Computer Science. pp. 77- 91 ,(2002) , 10.1007/3-540-45728-3_7