What Is the Nearest Neighbor in High Dimensional Spaces

作者: Alexander Hinneburg , Charu C. Aggarwal , Daniel A. Keim

DOI:

关键词:

摘要: Nearest neighbor search in high dimensional spaces is an interesting and important problem which relevant for a wide variety of novel database applications. As recent results show, however, the very di cult one, not only with regards to performance issue but also quality issue. In this paper, we discuss identify new generalized notion nearest as space. contrast previous approaches, our does treat all dimensions equally uses criterion select (projections) respect given query. example useful criterion, rate how well data clustered around query point within selected projection. We then propose e cient ective algorithm solve problem. Our experiments based on number real synthetic sets show that approach provides insights into nature

参考文章(18)
Hongxing He, Warwick Graco, Xin Yao, Application of Genetic Algorithm and k-Nearest Neighbour Method in Medical Fraud Detection simulated evolution and learning. pp. 74- 81 ,(1998) , 10.1007/3-540-48873-1_11
F. Bonchi, F. Giannotti, G. Mainetto, D. Pedreschi, Using Data Mining Techniques in Fiscal Fraud Detection data warehousing and knowledge discovery. pp. 369- 376 ,(1999) , 10.1007/3-540-48298-9_39
Rajiv Mehrotra, James Gary, Feature-index-based similar shape retrieval Proceedings of the third IFIP WG2.6 working conference on Visual database systems 3 (VDB-3). pp. 46- 65 ,(1997) , 10.1007/978-0-387-34905-3_4
Hans-Jörg Schek, Stephen Blott, Roger Weber, A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces very large data bases. pp. 194- 205 ,(1998)
Gerard Salton, Automatic text processing: the transformation, analysis, and retrieval of information by computer Addison-Wesley Longman Publishing Co., Inc.. ,(1989)
Christos Faloutsos, Flip Korn, Zenon Protopapas, Nikolaos Sidiropoulos, Eliot Siegel, Fast nearest neighbor search in medical image databases very large data bases. pp. 215- 226 ,(1996)
C. Faloutsos, R. Barber, M. Flickner, J. Hafner, W. Niblack, D. Petkovic, W. Equitz, Efficient and effective querying by image content intelligent information systems. ,vol. 3, pp. 231- 262 ,(1994) , 10.1007/BF00962238
Karen Kukich, Techniques for automatically correcting words in text ACM Computing Surveys. ,vol. 24, pp. 377- 439 ,(1992) , 10.1145/146370.146380
M. Ankerst, H.-P. Kriegel, T. Seidl, A multistep approach for shape similarity search in image databases IEEE Transactions on Knowledge and Data Engineering. ,vol. 10, pp. 996- 1004 ,(1998) , 10.1109/69.738362
S Altschula, Warren Gisha, Webb Millerb, E Meyersc, D Lipmana, None, Basic Local Alignment Search Tool Journal of Molecular Biology. ,vol. 215, pp. 403- 410 ,(1990) , 10.1016/S0022-2836(05)80360-2