Incremental Generalization for Mining in a Data Warehousing Environment

作者: Martin Ester , Rüdiger Wittmann , None

DOI: 10.1007/BFB0100982

关键词: GeneralizationBase (topology)Cluster analysisComputer scienceCardinalityData warehouseVisualizationAutomatic summarizationRelation (database)Data mining

摘要: On a data warehouse, either manual analyses supported by appropriate visualization tools or (semi-) automatic mining may be performed, e.g. clustering, classification and summarization. Attribute-oriented generalization is common method for the task of Typically, in warehouse update operations are collected applied to periodically. Then, all derived information has updated as well. Due very large size base relations, it highly desirable perform these updates incrementally. In this paper, we present algorithms incremental attribute-oriented with conflicting goals good efficiency minimal overly generalization. The insertions deletions based on materialization relation at an intermediate level, i.e. anchor relation. Our experiments demonstrate that can performed efficiently low degree Furthermore, optimal cardinality sets determined experimentally yielding best efficiency.

参考文章(8)
Ramakrishnan Srikant, Rakesh Agrawal, Fast algorithms for mining association rules very large data bases. pp. 580- 592 ,(1998)
Hans-Peter Kriegel, Martin Ester, Jörg Sander, Xiaowei Xu, A density-based algorithm for discovering clusters in large spatial Databases with Noise knowledge discovery and data mining. pp. 226- 231 ,(1996)
Hans-Peter Kriegel, Martin Ester, Xiaowei Xu, A database interface for clustering in large spatial databases knowledge discovery and data mining. pp. 94- 99 ,(1995)
D.W. Cheung, Jiawei Han, V.T. Ng, C.Y. Wong, Maintenance of discovered association rules in large databases: an incremental updating technique Proceedings of the Twelfth International Conference on Data Engineering. pp. 106- 114 ,(1996) , 10.1109/ICDE.1996.492094
J. Han, Y. Cai, N. Cercone, Data-driven discovery of quantitative rules in relational databases IEEE Transactions on Knowledge and Data Engineering. ,vol. 5, pp. 29- 40 ,(1993) , 10.1109/69.204089
Gregory Piatetsky-Shapiro, Usama Fayyad, Padhraic Smyth, Knowledge discovery and data mining: towards a unifying framework knowledge discovery and data mining. pp. 82- 88 ,(1996)
Inderpal Singh Mumick, Dallan Quass, Barinderpal Singh Mumick, Maintenance of data cubes and summary tables in a warehouse international conference on management of data. ,vol. 26, pp. 100- 111 ,(1997) , 10.1145/253260.253277
Nam Huyn, Multiple-View Self-Maintenance in Data Warehousing Environments very large data bases. pp. 26- 35 ,(1997)