Hashing of Databases with the Use of Metric Properties of the Hamming Space

作者: V. B. Balakirsky

DOI: 10.1093/COMJNL/BXH059

关键词:

摘要: Hashing of databases is considered from the point view information and coding theory. The records a database are represented as binary vectors same length stored in external memory computer. task formulated follows: given pattern fixed size working memory, form set addresses that can disagree with numberof positions smaller than threshold value. We use metric properties Hamming space show computational efforts needed to search for be essentially decreased by using triangle inequality distances between vectors. Furthermore, an introduction Lee distance containing leads new where effectively used.

参考文章(18)
Richard E. Ladner, Kevin C. Zatloukal, Mary Holland Johnson, Nearest neighbor search for data compression. Data Structures, Near Neighbor Searches, and Methodology. pp. 69- 86 ,(1999)
Piotr Indyk, Aristides Gionis, Rajeev Motwani, Similarity Search in High Dimensions via Hashing very large data bases. pp. 518- 529 ,(1999)
Jeffrey S. Vitter, Wen-Chin Chen, Design and Analysis of Coalesced Hashing ,(1986)
Josef Pieprzyk, Babak Sadeghiyan, Design of hashing algorithms ,(1993)
Gaston H. Gonnet, Per-Åke Larson, External hashing with limited internal storage Journal of the ACM. ,vol. 35, pp. 161- 184 ,(1988) , 10.1145/42267.42274
ALVIS BRAZMA, INGE JONASSEN, INGVAR EIDHAMMER, DAVID GILBERT, Approaches to the Automatic Discovery of Patterns in Biosequences Journal of Computational Biology. ,vol. 5, pp. 279- 305 ,(1998) , 10.1089/CMB.1998.5.279
Piotr Indyk, Rajeev Motwani, Prabhakar Raghavan, Santosh Vempala, Locality-preserving hashing in multidimensional spaces symposium on the theory of computing. pp. 618- 625 ,(1997) , 10.1145/258533.258656
Eyal Kushilevitz, Rafail Ostrovsky, Yuval Rabani, Efficient search for approximate nearest neighbor in high dimensional spaces symposium on the theory of computing. pp. 614- 623 ,(1998) , 10.1145/276698.276877
E. N. Gilbert, Capacity of a Burst-Noise Channel Bell System Technical Journal. ,vol. 39, pp. 1253- 1265 ,(1960) , 10.1002/J.1538-7305.1960.TB03959.X
S. Arya, D. M. Mount, O. Narayan, Accounting for boundary effects in nearest neighbor searching Discrete and Computational Geometry. ,vol. 16, pp. 155- 176 ,(1996) , 10.1007/BF02716805