Adaptive Noisy Clustering

作者: Sébastien Loustau , Michael Chichignoud

DOI:

关键词:

摘要: The problem of adaptive noisy clustering is investigated. Given a set observations $Z_i=X_i+\epsilon_i$, $i=1,...,n$, the goal to design clusters associated with law $X_i$'s, unknown density $f$ respect Lebesgue measure. Since we observe corrupted sample, direct approach as popular {\it $k$-means} not suitable in this case. In paper, propose $k$-means minimization, which based on loss function and deconvolution estimator $f$. particular, suffers from dependence bandwidth involved kernel. Fast rates convergence for excess risk are proposed particular choice bandwidth, depends smoothness Then, turn out into main issue paper: data-driven bandwidth. We state an upper bound new selection rule, called ERC (Empirical Risk Comparison). This rule Lepski's principle, where empirical risks different bandwidths compared. Finally, illustrate that can be used many statistical problems $M$-estimation nuisance parameter.

参考文章(35)
V. I. Koltchinskii, Empirical geometry of multivariate data: a deconvolution approach Annals of Statistics. ,vol. 28, pp. 591- 629 ,(2000) , 10.1214/AOS/1016218232
David Pollard, Strong Consistency of $K$-Means Clustering Annals of Statistics. ,vol. 9, pp. 135- 140 ,(1981) , 10.1214/AOS/1176345339
Sébastien Loustau, Clément Marteau, Minimax fast rates for discriminant analysis with errors in variables arXiv: Statistics Theory. ,(2012) , 10.3150/13-BEJ564
C. Kervrann, J. Boulanger, Optimal Spatial Adaptation for Patch-Based Image Denoising IEEE Transactions on Image Processing. ,vol. 15, pp. 2866- 2878 ,(2006) , 10.1109/TIP.2006.877529
Emanuel Parzen, On Estimation of a Probability Density Function and Mode Annals of Mathematical Statistics. ,vol. 33, pp. 1065- 1076 ,(1962) , 10.1214/AOMS/1177704472
A. Antos, L. Gyorfi, A. Gyorgy, Individual convergence rates in empirical vector quantizer design IEEE Transactions on Information Theory. ,vol. 51, pp. 4013- 4022 ,(2005) , 10.1109/TIT.2005.856976
P.L. Bartlett, T. Linder, G. Lugosi, The minimax distortion redundancy in empirical quantizer design international symposium on information theory. ,vol. 44, pp. 1802- 1813 ,(1997) , 10.1109/18.705560
O. V. Lepskii, On a Problem of Adaptive Estimation in Gaussian White Noise Theory of Probability & Its Applications. ,vol. 35, pp. 454- 466 ,(1991) , 10.1137/1135065
V. Katkovnik, A new method for varying adaptive bandwidth selection IEEE Transactions on Signal Processing. ,vol. 47, pp. 2567- 2571 ,(1999) , 10.1109/78.782208
Olivier Bousquet, A Bennett concentration inequality and its application to suprema of empirical processes Comptes Rendus Mathematique. ,vol. 334, pp. 495- 500 ,(2002) , 10.1016/S1631-073X(02)02292-6