Automatic integration of Web search interfaces with WISE-Integrator

作者: Hai He , Weiyi Meng , Clement Yu , Zonghuan Wu

DOI: 10.1007/S00778-004-0126-4

关键词:

摘要: An increasing number of databases are becoming Web accessible through form-based search interfaces, and many these sources database-driven e-commerce sites. It is a daunting task for users to access numerous sites individually get the desired information. Hence, providing unified multiple engines selling similar products great importance in allowing compare from with ease. One key such capability integrate interfaces so that user queries can be submitted against integrated interface. Currently, integrating carried out either manually or semiautomatically, which inefficient difficult maintain. In this paper, we present WISE-Integrator - tool performs automatic integration Interfaces Search Engines. explores rich set special metainformation exists uses information identify matching attributes different integration. also resolves domain differences attributes. discuss how automatically extract needed by perform interface Our experimental results, based on 143 real-world four domains, indicate achieve high attribute accuracy produce high-quality without human interactions.

参考文章(22)
Hai He, Weiyi Meng, Clement Yu, Zonghuan Wu, Wise-integrator: an automatic integrator of web search interfaces for E-commerce very large data bases. pp. 357- 368 ,(2003) , 10.1016/B978-012722442-8/50039-2
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
H. Benetti, D. Beneventano, S. Bergamaschi, F. Guerra, M. Vincini, An information integration framework for e-commerce IEEE Intelligent Systems. ,vol. 17, pp. 18- 25 ,(2002) , 10.1109/5254.988444
William B. Frakes, Ricardo Baeza-Yates, Information Retrieval: Data Structures and Algorithms ,(1992)
Erhard Rahm, Philip A. Bernstein, A survey of approaches to automatic schema matching very large data bases. ,vol. 10, pp. 334- 350 ,(2001) , 10.1007/S007780100057
William W. Cohen, Integration of heterogeneous databases without common domains using queries based on textual similarity Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98. ,vol. 27, pp. 201- 212 ,(1998) , 10.1145/276304.276323
Sun Wu, Udi Manber, Fast text searching: allowing errors Communications of The ACM. ,vol. 35, pp. 83- 91 ,(1992) , 10.1145/135239.135244
Jiying Wang, Fred H. Lochovsky, Data extraction and label assignment for web databases Proceedings of the twelfth international conference on World Wide Web - WWW '03. pp. 187- 196 ,(2003) , 10.1145/775152.775179
Sonia Bergamaschi, Silvana Castano, Maurizio Vincini, Domenico Beneventano, Semantic integration of heterogeneous information sources data and knowledge engineering. ,vol. 36, pp. 215- 249 ,(2001) , 10.1016/S0169-023X(00)00047-1