作者: Wei Zhang , Tinggui Chen
DOI: 10.1007/978-3-642-28658-2_47
关键词: Data warehouse 、 Data transformation 、 Data integration 、 Data cleansing 、 Data pre-processing 、 Data stream mining 、 Transformation (function) 、 Computer science 、 Data mining 、 Data reduction
摘要: Data preprocessing includes data cleaning, integration, transformation and reduction. cleaning is aimed to remove unrelated or redundant items through two processes. integration three main problems each of them can be solved by kinds methods. generalization property construction standardization. Three algorithms used normalize the data. The last step reduction compress in order improve quality mining models. All these four steps are interrelated other shouldn’t separated. They work together final result mining.