Methods for evaluating and creating data quality

作者: William E. Winkler

DOI: 10.1016/J.IS.2003.12.003

关键词:

摘要: This paper provides a survey of two classes methods that can be used in determining and improving the quality individual files or groups files. The first are edit/imputation for maintaining business rules imputing missing data. second data cleaning finding duplicates within across

参考文章(53)
Thomas F. Petkunas, William E. Winkler, THE DISCRETE EDIT SYSTEM ,(1997)
Monica Scannapieco, Paola Bertolazzi, Luca De Santis, Automatic Record Matching in Cooperative Information Systems ,(2002)
A. Blanton Godfrey, Thomas C. Redman, Data Quality For The Information Age ,(1997)
David Loshin, Enterprise knowledge management: the data quality approach Morgan Kaufmann Publishers Inc.. ,(2000)
James Franklin, The elements of statistical learning : data mining, inference,and prediction The Mathematical Intelligencer. ,vol. 27, pp. 83- 85 ,(2005) , 10.1007/BF02985802
Rohit Ananthakrishna, Surajit Chaudhuri, Venkatesh Ganti, Eliminating fuzzy duplicates in data warehouses very large data bases. pp. 586- 597 ,(2002) , 10.1016/B978-155860869-6/50058-5