Toward large scale integration: Building a MetaQuerier over databases on the Web

作者: Kevin Chen Chuan Chang , Zhen Zhang , Bin He

DOI:

关键词:

摘要: The Web has been rapidly “deepened” by myriad searchable databases online, where data are hidden behind query interfaces. Toward large scale integration over this “deep Web,” we have building the MetaQuerier system– for both exploring (to find) and integrating query) on Web. As an interim report, first, paper proposes our goal of Web-scale integration– With its dynamic ad-hoc nature, such mandates source discovery on-thefly translation. Second, present system architecture underlying technology key subsystems in ongoing implementation. Third, discuss “lessons” learned to date, focusing efforts integration, putting individual function together. On one hand, observe that, across subsystems, is itself non-trivial– which presents challenges opportunities beyond isolation. other also there emerge unified insights “holistic integration”– leverage as a unique opportunity information integration.

参考文章(33)
Kevin Chen-Chuan Chang, Zhen Zhang, Bin He, On-the-Fly Constraint Mapping across Web Query Interfaces ,(2004)
Hai He, Weiyi Meng, Clement Yu, Zonghuan Wu, Wise-integrator: an automatic integrator of web search interfaces for E-commerce very large data bases. pp. 357- 368 ,(2003) , 10.1016/B978-012722442-8/50039-2
Nicholas Kushmerick, Daniel S. Weld, Wrapper induction for information extraction international joint conference on artificial intelligence. pp. 729- 737 ,(1997)
Joann J. Ordille, Anand Rajaraman, Alon Y. Levy, Querying Heterogeneous Information Sources Using Source Descriptions very large data bases. pp. 251- 262 ,(1996)
Bin He, Tao Tao, Kevin Chen-Chuan Chang, Organizing structured web sources by query schemas: a clustering approach conference on information and knowledge management. pp. 22- 31 ,(2004) , 10.1145/1031171.1031178
Kevin Chen-Chuan Chang, Bin He, Chengkai Li, Mitesh Patel, Zhen Zhang, Structured databases on the web: observations and implications international conference on management of data. ,vol. 33, pp. 61- 70 ,(2004) , 10.1145/1031570.1031584
Yigal Arens, Craig A. Knoblock, Wei-Min Shen, Query reformulation for dynamic information integration intelligent information systems. ,vol. 6, pp. 99- 130 ,(1996) , 10.1007/BF00122124
Catriel Beeri, Alon Y. Levy, Marie-Christine Rousset, Rewriting queries using views in description logics symposium on principles of database systems. pp. 99- 108 ,(1997) , 10.1145/263661.263673
Alon Y. Halevy, Pedro Domingos, Philip A. Bernstein, Jayant Madhavan, Representing and reasoning about mappings between domain models national conference on artificial intelligence. pp. 80- 86 ,(2002) , 10.5555/777092.777108
Erhard Rahm, Philip A. Bernstein, A survey of approaches to automatic schema matching very large data bases. ,vol. 10, pp. 334- 350 ,(2001) , 10.1007/S007780100057