Data Preprocessing for Web Data Mining

作者: Wei Zhang , Tinggui Chen

DOI: 10.1007/978-3-642-28658-2_47

关键词: Data warehouseData transformationData integrationData cleansingData pre-processingData stream miningTransformation (function)Computer scienceData miningData reduction

摘要: Data preprocessing includes data cleaning, integration, transformation and reduction. cleaning is aimed to remove unrelated or redundant items through two processes. integration three main problems each of them can be solved by kinds methods. generalization property construction standardization. Three algorithms used normalize the data. The last step reduction compress in order improve quality mining models. All these four steps are interrelated other shouldn’t separated. They work together final result mining.

参考文章(4)
Lizhen Liu, Junjie Chen, Hantao Song, The research of Web mining world congress on intelligent control and automation. ,vol. 3, pp. 2333- 2337 ,(2002) , 10.1109/WCICA.2002.1021507
Micheline Kamber, Jiawei Han, Jian Pei, Data Mining: Concepts and Techniques ,(2000)
Chen Jian, Research on Individualized Information Services Based on Internet Sci/tech Information Development & Economy. ,(2005)