Discrete models for data imputation

作者: Renato Bruni

DOI: 10.1016/J.DAM.2004.04.004

关键词:

摘要: The paper is concerned with the problem of automatic detection and correction inconsistent or out range data in a general process statistical collection. proposed approach able to deal hierarchical containing both qualitative quantitative values. As customary, erroneous records are detected by formulating set rules. Erroneous should then be corrected, modifying as less possible data, while causing minimum perturbation original frequency distributions data. Such called imputation. By encoding rules linear inequalities, we convert imputation problems into integer programming problems. procedure tested on real-world case census. Results extremely encouraging from computational quality point view.

参考文章(18)
Paolo Nobili, Antonio Sassano, A Separation Routine for the Set Covering Polytope. integer programming and combinatorial optimization. pp. 201- 219 ,(1992)
Gianluigi Greco, Sergio Greco, Ester Zumpano, A Logic Programming Approach to the Integration, Repairing and Querying of Inconsistent Databases international conference on logic programming. ,vol. 2237, pp. 348- 364 ,(2001) , 10.1007/3-540-45635-X_31
J.J. Hox, A review of current software for handling missing data Kwantitatieve Methoden. ,vol. 20, pp. 123- 138 ,(1999)
Enrico Franconi1, Antonio Laureti Palma, Nicola Leone, Simona Perri, Francesco Scarcello, Census Data Repair: a Challenging Application of Disjunctive Logic Programming international conference on logic programming. ,vol. 2250, pp. 561- 578 ,(2001) , 10.1007/3-540-45653-8_39
John Kelly, Lene Mikkelsen, Focus on the recommendations for the 2000 censuses of population and housing in the ECE region Statistical journal of the United Nations economic commission for Europe. ,vol. 15, pp. 177- 178 ,(1998) , 10.3233/SJU-1998-15207
Renato Bruni, Antonio Sassano, Errors Detection and Correction in Large Scale Data Collecting intelligent data analysis. pp. 84- 94 ,(2001) , 10.1007/3-540-44816-0_9
Laurence A. Wolsey, George L. Nemhauser, Integer and Combinatorial Optimization ,(1988)
Cliff T. Ragsdale, Patrick G. McKeown, On solving the continuous data editing problem Computers & Operations Research. ,vol. 23, pp. 263- 273 ,(1996) , 10.1016/0305-0548(96)81769-2