From outliers to prototypes: Ordering data

作者: Stefan Harmeling , Guido Dornhege , David Tax , Frank Meinecke , Klaus-Robert Müller

DOI: 10.1016/J.NEUCOM.2005.05.015

关键词:

摘要: We propose simple and fast methods based on nearest neighbors that order objects from high-dimensional data sets typical points to untypical points. On the one hand, we show these easy-to-compute orderings allow us detect outliers (i.e. very points) with a performance comparable or better than other often much more sophisticated methods. how use prototypes (very which facilitate exploratory analysis algorithms such as noisy nonlinear dimensionality reduction clustering. Comprehensive experiments demonstrate validity of our approach.

参考文章(39)
Stephen M. Omohundro, Efficient Algorithms with Neural Network Behavior. Complex Systems. ,vol. 1, ,(1987)
Jose Hanson, Nathalie Japkowicz, Casimir Kulikowski, Concept learning in the absence of counterexamples: an autoassociation-based approach to classification Rutgers University. ,(1999) , 10.7282/T3J96B11
D.M.J. Tax, One-class classification TU Delft, Delft University of Technology. ,(2001)
E Eskin, Andrew Arnold, Michael Prerau, Leonid Portnoy, Sal Stolfo, A GEOMETRIC FRAMEWORK FOR UNSUPERVISED ANOMALY DETECTION: DETECTING INTRUSIONS IN UNLABELED DATA APPLICATIONS OF DATA MINING IN COMPUTER SECURITY. pp. 0- 0 ,(2002) , 10.7916/D8D50TQT
Luc Devroye, Gary L. Wise, Detection of Abnormal Behavior Via Nonparametric Estimation of the Support Siam Journal on Applied Mathematics. ,vol. 38, pp. 480- 488 ,(1980) , 10.1137/0138038
Charles E. Metz, Basic principles of ROC analysis Seminars in Nuclear Medicine. ,vol. 8, pp. 283- 298 ,(1978) , 10.1016/S0001-2998(78)80014-2
Vic Barnett, Toby Lewis, Outliers in Statistical Data ,(1978)
Katrien Van Driessen, Peter J. Rousseeuw, A fast algorithm for the minimum covariance determinant estimator Technometrics. ,vol. 41, pp. 212- 223 ,(1999) , 10.2307/1270566