Optimally Quantized and Smoothed Histograms.

作者: Robert M. Haralick , Mingzhou Song

DOI:

关键词:

摘要: We propose an approach using optimal quantization and smoothing to generate adaptive histograms for multi-class one dimensional data. The discretization of data is in maximizing a quantizer performance measure, defined by combination average log likelihood, entropy correct classification probability. partition found dynamic programming. density each bin obtained technique that can be considered generalized k nearest neighbor estimation algorithm. However, our much more efficient. Experimental results demonstrated the effectiveness optimally quantized smoothed histograms. Even though obtaining takes about quadratic time sample size, histogram efficient use than typical kernel methods. Therefore, are suitable applications with massive set.

参考文章(10)
Ron Kohavi, Mehran Sahami, Error-based and entropy-based discretization of continuous features knowledge discovery and data mining. pp. 114- 119 ,(1996)
W.K. Härdle, D.W. Scott, Smoothing by weighted averaging of rounded points Research Papers in Economics. ,(1992)
Truxton Fulton, Simon Kasif, Steven Salzberg, Efficient Algorithms for Finding Multi-way Splits for Decision Trees Machine Learning Proceedings 1995. pp. 244- 251 ,(1995) , 10.1016/B978-1-55860-377-6.50038-4
Keki B. Irani, Usama M. Fayyad, Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning international joint conference on artificial intelligence. ,vol. 2, pp. 1022- 1027 ,(1993)
Andrew Nobel, Histogram regression estimation using data-dependent partitions Annals of Statistics. ,vol. 24, pp. 1084- 1105 ,(1996) , 10.1214/AOS/1032526958
Jeffrey S. Simonoff, Smoothing Methods in Statistics ,(1996)
Robert M. Haralick, The table look-up rule Communications in Statistics-theory and Methods. ,vol. 5, pp. 1163- 1191 ,(1976) , 10.1080/03610927608827433
Gábor Lugosi, Andrew Nobel, Consistency of Data-driven Histogram Methods for Density Estimation and Classification Annals of Statistics. ,vol. 24, pp. 687- 706 ,(1996) , 10.1214/AOS/1032894460