Mediating and Metasearching on the Internet.

作者: Yannis Papakonstantinou , Luis Gravano

DOI:

关键词: Computer scienceInformation retrievalSIMPLE (military communications protocol)Ranking (information retrieval)Relational databaseTransparency (human–computer interaction)Interface (Java)The InternetArchitectureOrder (business)

摘要: The Internet emerges as the largest database. Increasingly, users want to issue complex queries across sources obtain data they require. However, finding relevant information and querying them manually is problematic: there are numerous sources, vary in type of objects contain interface present their users. Some text documents support simple query models where a just list keywords. Other more structured provide interfaces style relational languages. Furthermore, have fuse results by merging information, removing redundancies, ranking answer appropriate order, so on. Since it tedious contact several heterogeneous can benefit from metasearchers mediators, which services that with virtual integrated view sources. Users access using unified offers location, model, transparency, i.e., illusion single database do not be aware location Although applications might directly through wrappers, mediators offer an world, related same entity has been fused together, redundancies eliminated, inconsistencies removed. architecture virtually identical (Figure 1). Wrappers export common model each source’s data. also interface. After receiving query, wrapper translates into source-specific or command, hence giving transparency user. Then, underlying source format. To evaluate user over multiple databases, both will typically perform three main tasks:

参考文章(29)
Anand Rajaraman, Jeffrey D. Ullman, Alon Y. Levy, Answering Queries Using Limited External Processors. symposium on principles of database systems. pp. 227- 237 ,(1996)
Guijun Wang, Susan Gauch, Information fusion with ProFusion. WebNet. ,(1996)
Yannis Papakonstantinou, Ashish Gupta, Laura Haas, Capabilities-Based Query Rewriting in Mediator Systems Distributed and Parallel Databases. ,vol. 6, pp. 73- 110 ,(1998) , 10.1023/A:1008646830769
Stanford University. Computer Science Department, Merging Ranks from Heterogeneous Internet Sources very large data bases. pp. 196- 205 ,(1997)
Yannis Papakonstantinou, Vasilis Vassalos, Using Knowledge of Redundancy for Query Optimization in Mediators ,(1998)
Daniela Florescu, Daphne Koller, None, Using Probabilistic Information in Data Integration very large data bases. pp. 216- 225 ,(1997)
Yannis Papakonstantinou, Serge Abiteboul, Hector Garcia-Molina, Object Fusion in Mediator Systems very large data bases. pp. 413- 424 ,(1996)
Ben Johnson-Laird, Ellen M. Voorhees, Narendra Kumar Gupta, The Collection Fusion Problem. text retrieval conference. pp. 95- 104 ,(1994)
Yehoshua Sagiv, Anand Rajaraman, Jeffrey D. Ullman, Answering Queries using Templates with Binding Patterns symposium on principles of database systems. pp. 105- 112 ,(1995)
Oren Etzioni, Erik Selberg, Multi-Engine Search and Comparison Using the MetaCrawler. World Wide Web J.. ,vol. 1, ,(1996)