System and method for accelerated query evaluation of very large full-text databases

作者: Graham Spencer

DOI:

关键词: tf–idfSoftwareComputer scienceLookup tableMeasure (data warehouse)CacheInformation retrievalTraverseInverted indexData miningTerm (time)

摘要: A system, method, and various software products provide for improved information retrieval in very large document databases through the use of a predetermined static cache. The cache includes terms that appear number documents, plurality documents ordered by contribution term makes to score document. is scalar measure influence computed score. reflects both within frequency between term. In addition, each lookup table references selected entries an inverted index. Queries database are then processed first traversing obtaining thereform computing from this information. Additional other query obtained looking up tables terms, such index, or searching caches terms.

参考文章(4)
Michael Persin, Document filtering for fast ranking international acm sigir conference on research and development in information retrieval. pp. 339- 348 ,(1994) , 10.5555/188490.188597
Wai Yee Peter Wong, Dik Lun Lee, Implementations of partial document ranking using inverted files Information Processing and Management. ,vol. 29, pp. 647- 669 ,(1993) , 10.1016/0306-4573(93)90085-R
Ijsbrand Jan Aalbersberg, A document retrieval model based on term frequency ranks international acm sigir conference on research and development in information retrieval. pp. 163- 172 ,(1994) , 10.5555/188490.188552