ABOUT DATABASE SUMMARIZATION

作者: MINYAR SASSI , AMEL GRISSA TOUZI , HABIB OUNELLI , INES AISSA

DOI: 10.1142/S0218488510006453

关键词:

摘要: The summarization system takes a Database (DB) table as input and produces reduced version of this through both rewriting generalization process. resulting provides records with less precision than the original but it is very informative actual DB content. This form can be used for advanced Data Mining processes. Several approaches have been proposed in literature. most recent SaintEtiQ model, initially by Raschia.1 Based on hierarchical conceptual clustering algorithm, builds summary hierarchy from records. In paper, we propose to extend model introducing some optimization processes including: (i) minimization expert risks domain, (iii) building records, (iv) cooperation user giving him summaries different levels.

参考文章(24)
Noureddine Mouaddib, Guillaume Raschia, Régis Saint-Paul, General purpose database summarization very large data bases. pp. 733- 744 ,(2005)
Patrick Bosc, Olivier Pivert, Laurent Ughetto, On Data Summaries Based on Gradual Rules computational intelligence. pp. 512- 521 ,(1999) , 10.1007/3-540-48774-3_56
Laks V.S. Lakshmanan, Jian Pei, Jiawei Han, Quotient cube: how to summarize the semantics of a data cube very large data bases. pp. 778- 789 ,(2002) , 10.1016/B978-155860869-6/50074-3
H. V. Jagadish, Raymond T. Ng, J. Madar, Semantic Compression and Pattern Extraction with Fascicles very large data bases. pp. 186- 198 ,(1999) , 10.14288/1.0051612
C. Combes, N. Meskens, C. Rivat, J.-P. Vandamme, Using KDD Process to Forecast the Duration of Surgery International Journal of Production Economics. ,vol. 112, pp. 279- 293 ,(2008) , 10.1016/J.IJPE.2006.12.068
Haojun Sun, Shengrui Wang, Qingshan Jiang, FCM-Based Model Selection Algorithms for Determining the Number of Clusters Pattern Recognition. ,vol. 37, pp. 2027- 2037 ,(2004) , 10.1016/J.PATCOG.2004.03.012
Laks V. S. Lakshmanan, Jian Pei, Yan Zhao, SOCQET: semantic OLAP with compressed cube and summarization international conference on management of data. pp. 658- 658 ,(2003) , 10.1145/872757.872843
Ronald R. Yager, A new approach to the summarization of data Information Sciences. ,vol. 28, pp. 69- 86 ,(1982) , 10.1016/0020-0255(82)90033-0
G. Raschia, N. Mouaddib, SAINTETIQ: a fuzzy set-based approach to database summarization Fuzzy Sets and Systems. ,vol. 129, pp. 137- 162 ,(2002) , 10.1016/S0165-0114(01)00197-X