Annotating Search Results from Web Databases

作者: Yiyao Lu , Hai He , Hongkun Zhao , Weiyi Meng , Clement Yu

DOI: 10.1109/TKDE.2011.175

关键词:

摘要: An increasing number of databases have become web accessible through HTML form-based search interfaces. The data units returned from the underlying database are usually encoded into result pages dynamically for human browsing. For to be machine processable, which is essential many applications such as deep collection and Internet comparison shopping, they need extracted out assigned meaningful labels. In this paper, we present an automatic annotation approach that first aligns on a page different groups in same group semantic. Then, each annotate it aspects aggregate annotations predict final label it. wrapper site automatically constructed can used new database. Our experiments indicate proposed highly effective.

参考文章(35)
Jeff Heflin, James Hendler, Searching the Web with SHOE Defense Technical Information Center. ,(2000) , 10.21236/ADA440405
Jiying Wang, Ji-Rong Wen, Fred Lochovsky, Wei-Ying Ma, Instance-based schema matching for web databases by domain-specific query probing very large data bases. pp. 408- 419 ,(2004) , 10.1016/B978-012088469-8.50038-3
Nicholas Kushmerick, Daniel S. Weld, Wrapper induction for information extraction international joint conference on artificial intelligence. pp. 729- 737 ,(1997)
Borislav Popov, Atanas Kiryakov, Angel Kirilov, Dimitar Manov, Damyan Ognyanoff, Miroslav Goranov, KIM: semantic annotation platform international semantic web conference. pp. 834- 849 ,(2003) , 10.1007/978-3-540-39718-2_53
Dayne Freitag, Multistrategy Learning for Information Extraction international conference on machine learning. pp. 161- 169 ,(1998)
Zonghuan Wu, Vijay Raghavan, Hua Qian, K.V. Rama, Weiyi Meng, Hai He, C. Yu, Towards automatic incorporation of search engines into a large-scale metasearch engine web intelligence. pp. 658- 661 ,(2003) , 10.1109/WI.2003.1241290
Hai He, Weiyi Meng, Clement Yu, Zonghuan Wu, Constructing interface schemas for search interfaces of web databases web information systems engineering. pp. 29- 42 ,(2005) , 10.1007/11581062_3
L. Liu, C. Pu, W. Han, XWRAP: an XML-enabled wrapper construction system for Web information sources international conference on data engineering. pp. 611- 621 ,(2000) , 10.1109/ICDE.2000.839475
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)