Concept hierarchy based text database categorization in a metasearch engine environment

作者: W. Wang , W. Meng , C. Yu

DOI: 10.1109/WISE.2000.882403

关键词:

摘要: Document categorization, as a technique to improve the retrieval of useful documents, has been extensively investigated. One important issue in large-scale meta-search engine is select text databases that are likely contain documents for given query. We believe database categorization can be potentially effective good selection, especially Internet environment, where short queries usually submitted. In this paper, we propose and evaluate several algorithms. This study indicates that, while some document algorithms could adopted take into consideration special characteristics may more effective. Preliminary experimental results provided compare proposed

参考文章(28)
Art Medlar, Brewster Kahle, An information system for corporate users: wide area information servers Online. ,vol. 15, pp. 56- 60 ,(1991)
Udi Manber, Peter A. Bigot, The search broker usenix symposium on internet technologies and systems. pp. 21- 21 ,(1997)
Ke Wang, Senqiang Zhou, Shiang Chen Liew, Building Hierarchical Classifiers Using Class Proximity very large data bases. pp. 363- 374 ,(1999)
Stanford University. Computer Science Department, Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies very large data bases. pp. 78- 89 ,(1995)
Clement Yu, King-Lup Liu, Wensheng Wu, Weiyi Meng, N. Rishe, Finding the most similar documents across multiple text databases Proceedings IEEE Forum on Research and Technology Advances in Digital Libraries. pp. 150- 162 ,(1999) , 10.1109/ADL.1999.777710
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
Christoph Baumgarten, A probabilistic model for distributed information retrieval international acm sigir conference on research and development in information retrieval. ,vol. 31, pp. 258- 266 ,(1997) , 10.1145/258525.258585
E. Selberg, O. Etzioni, The MetaCrawler architecture for resource aggregation on the Web IEEE Intelligent Systems. ,vol. 12, pp. 11- 14 ,(1997) , 10.1109/64.577468
Daniel Dreilinger, Adele E. Howe, Experiences with selecting search engines using metasearch ACM Transactions on Information Systems. ,vol. 15, pp. 195- 222 ,(1997) , 10.1145/256163.256164