Pattern Recognition with Partly Missing Data

作者: John K. Dixon

DOI: 10.1109/TSMC.1979.4310090

关键词: Missing dataData setPattern recognitionInterpolationData miningArtificial intelligenceBlankComputer scienceData reductionPattern recognition (psychology)General Engineering

摘要: An experimental comparison of several simple inexpensive ways doing pattern recognition when some data elements are missing (blank) is presented. Pattern methods usually designed to deal with perfect data, but in the real world often due error, equipment failure, change plans, etc. Six dealing blanks tested on five sets. Blanks were inserted at random locations into A version K-nearest neighbor technique was used classify and evaluate six methods. Two found be consistently poor. Four generally good. Suggestions given for choosing best method a particular application.

参考文章(9)
C. T. Mong, J. R. Slagle, R. C. T. Lee, Application of clustering to estimate missing data and improve data integrity international conference on software engineering. pp. 539- 544 ,(1976) , 10.5555/800253.807729
C. William Skinner, A heuristic approach to inductive inference in fact retrieval systems Communications of the ACM. ,vol. 17, pp. 707- 712 ,(1974) , 10.1145/361604.361633
R. A. FISHER, THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS Annals of Human Genetics. ,vol. 7, pp. 179- 188 ,(1936) , 10.1111/J.1469-1809.1936.TB02137.X
J.R. Slagle, C.-L. Chang, R.C.T. Lee, Experiments with some cluster analysis algorithms Pattern Recognition. ,vol. 6, pp. 181- 187 ,(1974) , 10.1016/0031-3203(74)90020-X
George M. White, Paul J. Fong, k-Nearest-Neighbor Decision Rule Performance in a Speech Recognition System IEEE Transactions on Systems, Man, and Cybernetics. ,vol. SMC-5, pp. 389- 389 ,(1975) , 10.1109/TSMC.1975.5408420
Michael Hammer, Error detection in data base systems Proceedings of the June 7-10, 1976, national computer conference and exposition on - AFIPS '76. pp. 795- 801 ,(1976) , 10.1145/1499799.1499908
T. Cover, P. Hart, Nearest neighbor pattern classification IEEE Transactions on Information Theory. ,vol. 13, pp. 21- 27 ,(1967) , 10.1109/TIT.1967.1053964
Sahibsingh A. Dudani, The Distance-Weighted k-Nearest-Neighbor Rule IEEE Transactions on Systems, Man, and Cybernetics. ,vol. SMC-6, pp. 325- 327 ,(1976) , 10.1109/TSMC.1976.5408784
Frank M. Andrews, Multiple classification analysis ,(1967)