Query- vs. Crawling-based Classification of Searchable Web Databases.

作者: Panagiotis G. Ipeirotis , Mehran Sahami , Luis Gravano

DOI:

关键词:

摘要: The World-Wide Web is one of the main channels through which people currently exchange information. Unfortunately, this information not characterized in a way that would make its semantics readily understandable by computers, complicates building value-added services on top existing An ambitious effort aims to facilitate development such so-called “Semantic Web.” According Berners-Lee et al. [1]:

参考文章(9)
Ora Lassila, Tim Berners-lee, James A. Hendler, The Semantic Web" in Scientific American ,(2001)
William W. Cohen, Fast Effective Rule Induction Machine Learning Proceedings 1995. pp. 115- 123 ,(1995) , 10.1016/B978-1-55860-377-6.50023-2
Jamie Callan, Margaret Connell, Query-based sampling of text databases ACM Transactions on Information Systems. ,vol. 19, pp. 97- 130 ,(2001) , 10.1145/382979.383040
Junghoo Cho, Hector Garcia-Molina, Lawrence Page, Efficient crawling through URL ordering the web conference. ,vol. 30, pp. 161- 172 ,(1998) , 10.1016/S0169-7552(98)00108-1
W. Wang, W. Meng, C. Yu, Concept hierarchy based text database categorization in a metasearch engine environment web information systems engineering. ,vol. 1, pp. 283- 290 ,(2000) , 10.1109/WISE.2000.882403
Guijun Wang, Mario Gomez, Susan Gauch, ProFusion*: Intelligent Fusion from Multiple, Distributed Search Engines 1 Journal of Universal Computer Science. ,vol. 2, pp. 637- 649 ,(1996)
Jamie Callan, Margaret Connell, Aiqun Du, Automatic discovery of language models for text databases ACM SIGMOD Record. ,vol. 28, pp. 479- 490 ,(1999) , 10.1145/304181.304224
R. Dolin, D. Agrawal, E. El Abbadi, Scalable collection summarization and selection acm international conference on digital libraries. pp. 49- 58 ,(1999) , 10.1145/313238.313257
Panagiotis G. Ipeirotis, Luis Gravano, Mehran Sahami, Probe, count, and classify: categorizing hidden web databases international conference on management of data. ,vol. 30, pp. 67- 78 ,(2001) , 10.1145/375663.375671