R2LSH: A Nearest Neighbor Search Scheme Based on Two-dimensional Projected Spaces

作者: Kejing Lu , Mineichi Kudo

DOI: 10.1109/ICDE48307.2020.00095

关键词:

摘要: Locality sensitive hashing (LSH) is a widely practiced c-approximate nearest neighbor (c-ANN) search algorithm because of its appealing theoretical guarantee and empirical performance. However, available LSH-based solutions do not achieve good balance between cost quality certain limitations in their index structures.In this paper, we propose novel easy-to-implement disk- based method named R2LSH to answer ANN queries highdimensional spaces. In the indexing phase, maps data objects into multiple two-dimensional projected each space, group B+-trees constructed characterize corresponding distribution. query by setting query-centric ball space using dynamic counting technique, efficiently determines candidates returns results with required quality. Rigorous analysis reveals that proposed supports c-ANN for arbitrarily small c ≥ 1 probability guarantee. Extensive experiments on real datasets verify superiority over state-of-the-art methods.

参考文章(2)
H Jégou, M Douze, C Schmid, Product Quantization for Nearest Neighbor Search IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 33, pp. 117- 128 ,(2011) , 10.1109/TPAMI.2010.57
Yingfan Liu, Hong Cheng, Jiangtao Cui, PQBF: I/O-Efficient Approximate Nearest Neighbor Search by Product Quantization conference on information and knowledge management. pp. 667- 676 ,(2017) , 10.1145/3132847.3132901