A parallel indexed algorithm for information retrieval

作者: C. Stanfill , R. Thau , D. Waltz

DOI: 10.1145/75334.75345

关键词:

摘要: In this paper we present a parallel document ranking algorithm suitable for use on databases of 1-1000 GB, resident primary or secondary storage. The is based inverted indexes, and has two advantages over previously published retrieval signature files. First, it permits the employment strategies which cannot be easily implemented using files, specifically methods depend document-term weighting. Second, interactive searching evaluated via mixture analytic simulation techniques, with particular focus how cost-effectiveness efficiency change as size database, number processors, cost memory are altered. particular, find that if ratio processors and/or disks to database held constant, then resulting system remains constant. Furthermore, given there optimizes cost-effectiveness. Estimated response times also presented. Using these methods, appears cost-effective access in 100-1000 GB range can achieved current technology.

参考文章(10)
Stone, Parallel Querying of Large Databases: A Case Study IEEE Computer. ,vol. 20, pp. 11- 21 ,(1987) , 10.1109/MC.1987.1663384
Chris Faloutsos, Stavros Christodoulakis, Signature files: an access method for documents and its analytical performance evaluation ACM Transactions on Information Systems. ,vol. 2, pp. 267- 288 ,(1984) , 10.1145/2275.357411
W. Bruce Croft, Pasquale Savino, Implementing ranking strategies using text signatures ACM Transactions on Information Systems. ,vol. 6, pp. 42- 62 ,(1988) , 10.1145/42279.45947
University of Toronto. Computer Systems Research Group, Design Considerations for a Message File Server IEEE Transactions on Software Engineering. ,vol. SE-10, pp. 201- 210 ,(1984) , 10.1109/TSE.1984.5010223
Craig Stanfill, Brewster Kahle, Parallel free-text search on the connection machine system Communications of the ACM. ,vol. 29, pp. 1229- 1239 ,(1986) , 10.1145/7902.7907
Gerard Salton, Chris Buckley, Parallel text search methods Communications of the ACM. ,vol. 31, pp. 202- 215 ,(1988) , 10.1145/42372.42380
W. Daniel Hillis, The Connection Machine ,(1985)
Dennis Tsichritzis, Stavros Christodoulakis, Message files ACM Transactions on Information Systems. ,vol. 1, pp. 88- 98 ,(1983) , 10.1145/357423.357429