Partial-match retrieval using indexed descriptor files

作者: John L. Pfaltz , William J. Berman , Edgar M. Cagley

DOI: 10.1145/359007.359013

关键词:

摘要: In this paper we describe a practical method of partial-match retrieval in very large data files. A binary code word, called descriptor, is associated with each record the file. These descriptors are then used to form derived descriptor for block several records, which will serve as an index whole; hence, name “indexed files.”First structure these files described and simple, efficient algorithm presented. Then its expected behavior, terms storage accesses, analyzed detail. Two different file creation procedures sketched, number ways organization can be “tuned” particular application suggested.

参考文章(13)
C.S. Roberts, Partial-match retrieval via the method of superimposed codes Proceedings of the IEEE. ,vol. 67, pp. 1624- 1642 ,(1979) , 10.1109/PROC.1979.11543
D. G. Severance, J. V. Carlis, A Practical Approach to Selecting Record Access Paths ACM Computing Surveys. ,vol. 9, pp. 259- 272 ,(1977) , 10.1145/356707.356709
Robert F. Ling, General considerations on the design of an interactive system for data analysis Communications of The ACM. ,vol. 23, pp. 147- 154 ,(1980) , 10.1145/358826.358828
Oscar Vallarino, On the use of bit maps for multiple key retrieval Proceedings of the 1976 conference on Data : Abstraction, definition and structure -. ,vol. 8, pp. 108- 114 ,(1976) , 10.1145/800237.807128
John R. Files, Harry D. Huskey, An information retrieval system based on superimposed coding Proceedings of the November 18-20, 1969, fall joint computer conference on - AFIPS '69 (Fall). pp. 423- 432 ,(1969) , 10.1145/1478559.1478609
Alfred V. Aho, Jeffrey D. Ullman, Optimal partial-match retrieval when fields are independently specified ACM Transactions on Database Systems. ,vol. 4, pp. 168- 179 ,(1979) , 10.1145/320071.320074
Alfonso F. Cárdenas, Analysis and performance of inverted data base structures Communications of The ACM. ,vol. 18, pp. 253- 263 ,(1975) , 10.1145/360762.360766
David Lefkovitz, The large data base file structure dilemma. Journal of Chemical Information and Computer Sciences. ,vol. 15, pp. 14- 19 ,(1975) , 10.1021/CI60001A005
Ronald L. Rivest, Partial-Match Retrieval Algorithms SIAM Journal on Computing. ,vol. 5, pp. 19- 50 ,(1976) , 10.1137/0205003
Edgar Max Cagley, A retrieval strategy for large, multi-key files requiring frequent updating. A retrieval strategy for large, multi-key files requiring frequent updating.. pp. 48- 48 ,(1971)