Modeling Text Databases

作者: Ricardo Baeza-Yates , Gonzalo Navarro

DOI: 10.1007/0-387-23394-6_1

关键词: Simple (abstract algebra)SpacetimeCover (topology)Computer scienceDatabaseTheoretical modelsOverhead (computing)Text modelingFull text search

摘要: We present a unified view to models for text databases, proving new relations between empirical and theoretical models. A particular case that we cover is the Web. also introduce simple model random queries size of their answers, giving experimental results support them. As an example importance modeling, analyze time space overhead inverted files

参考文章(26)
W. Willinger, V. Paxson, WHERE MATHEMATICS MEETS THE INTERNET Notices of the American Mathematical Society. ,vol. 45, pp. 961- 970 ,(1998)
Samuel DeFazio, Overview of the Full-Text Document Retrieval Benchmark. The Benchmark Handbook. ,(1993)
Paul Barford, Azer Bestavros, Adam Bradley, Mark Crovella, Changes in Web client access patterns: Characteristics and caching implications World Wide Web. ,vol. 2, pp. 15- 28 ,(1999) , 10.1023/A:1019236319752
Udi Manber, Sun Wu, GLIMPSE: a tool to search through entire file systems usenix winter technical conference. pp. 4- 4 ,(1994)
Gonzalo Navarro, Ricardo Baeza-Yates, A New Indexing Method for Approximate String Matching combinatorial pattern matching. pp. 163- 185 ,(1999) , 10.1007/3-540-48452-3_13
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
Leo Egghe, The Distribution of N-Grams Scientometrics. ,vol. 47, pp. 237- 252 ,(2000) , 10.1023/A:1005634925734
E.S. De Moura, G. Navarro, N. Ziviani, R. Baeza-Yates, Direct pattern matching on compressed text string processing and information retrieval. pp. 90- 95 ,(1998) , 10.1109/SPIRE.1998.712987