Federated Search

作者: Milad Shokouhi , Luo Si

DOI:

关键词: Federated searchWorld Wide WebSearch engineParallel searchIndex (publishing)Information retrievalDeep WebCollection selectionCrawlingComputer science

摘要: Federated search (federated information retrieval or distributed retrieval) is a technique for searching multiple text collections simultaneously. Queries are submitted to subset of that most likely return relevant answers. The results returned by selected integrated and merged into single list. preferred over centralized alternatives in many environments. For example, commercial engines such as Google cannot easily index uncrawlable hidden web while federated systems can the contents without crawling. In enterprise environments, where each organization maintains an independent engine, techniques provide parallel collections. There three major challenges search. query, documents selected. This creates collection selection problem. To be able select suitable collections, need acquire some knowledge about collection, creating representation from before final presentation user. step result merging problem. The goal this work, comprehensive summary previous research on described above.

参考文章(248)
Milad Shokouhi, Central-rank-based collection selection in uncooperative distributed information retrieval european conference on information retrieval. pp. 160- 172 ,(2007) , 10.1007/978-3-540-71496-5_17
Norbert Fuhr, Optimum Database Selection in Networked IR. Networked Information Retrieval. ,(1996)
David Hawking, Nick Craswell, Paul B. Thistlewaite, Merging Results From Isolated Search Engines. australasian database conference. pp. 189- 200 ,(1999)
Udi Manber, Peter A. Bigot, The search broker usenix symposium on internet technologies and systems. pp. 21- 21 ,(1997)
Garrison W. Cottrell, Christopher Charles Vogt, Adaptive combination of evidence for information retrieval University of California, San Diego. ,(1999)
Henrik Nottelmann, Norbert Fuhr, Decision-Theoretic Resource Selection for Different Data Types in MIND multimedia information retrieval. pp. 43- 57 ,(2003) , 10.1007/978-3-540-24610-7_4
Daryl J. D'Souza, James A. Thom, Collection Selection Using n-Term Indexing. CODAS. pp. 52- 63 ,(1999)
James C. French, Gary A. Monroe, David R. Mikesell, Determining Stopping Criteria in the Generation of Web-Derived Langua ge Models University of Virginia. ,(2000)
Justin Zobel, Daryl J. D'Souza, James A. Thom, Is CORI Effective for Collection Selection? An Exploration of Parameters, Queries, and Data. australasian document computing symposium. pp. 41- 46 ,(2004)
Justin Zobel, Collection Selection via Lexicon Inspection australasian document computing symposium. ,(1997)