Searching XML documents via XML fragments

作者: David Carmel , Yoelle S. Maarek , Matan Mandelbrod , Yosi Mass , Aya Soffer

DOI: 10.1145/860435.860464

关键词:

摘要: Most of the work on XML query and search has stemmed from publishing database communities, mostly for needs business applications. Recently, Information Retrieval community began investigating issue to answer information discovery needs. Following this trend, we present here an approach where can be expressed in approximate manner as pieces documents or "XML fragments" same nature that are being searched. We extension vector space model searching collections via fragments ranking results by relevance. describe how have extended a full-text engine comply with model. The value proposed method is demonstrated relative high precision our system, which was among top performers recent INEX workshop. Our indicate certain queries more appropriate than others Specifically, relatively specific contexts but vague best situated reap benefit Finally show one may not fit all types it could worthwhile use different solutions

参考文章(11)
Yael Petruschka, Yoëlle S. Maarek, Michael Herscovici, David Carmel, Aya Soffer, Einat Amitay, Juru at TREC 10 - Experiments with Index Pruning. text retrieval conference. pp. 228- 236 ,(2001)
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
Alain Azagury, Michael E. Factor, Yoelle S. Maarek, Benny Mandler, A novel navigation paradigm for XML repositories Journal of the Association for Information Science and Technology. ,vol. 53, pp. 515- 525 ,(2002) , 10.1002/ASI.10062
Ellen M. Voorhees, Chris Buckley, The effect of topic set size on retrieval experiment error Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02. pp. 316- 323 ,(2002) , 10.1145/564376.564432
Gerard Salton, J. Allan, Chris Buckley, Approaches to passage retrieval in full text information systems international acm sigir conference on research and development in information retrieval. pp. 49- 58 ,(1993) , 10.1145/160688.160693
David Carmel, Yoelle Maarek, Aya Soffer, XML and information retrieval ACM SIGIR Forum. ,vol. 34, pp. 31- 36 ,(2000) , 10.1145/373593.373624
Norbert Fuhr, Kai Gross, XIRQL: a query language for information retrieval in XML documents international acm sigir conference on research and development in information retrieval. pp. 172- 180 ,(2001) , 10.1145/383952.383985
Nadav Efraty, Yoelle S. Maarek, Yosi Mass, Gad M. Landau, David Carmel, An Extension of the Vector Space Model for Querying XML Documents via XML Fragments 1 ,(2002)
Andrei Broder, A taxonomy of web search international acm sigir conference on research and development in information retrieval. ,vol. 36, pp. 3- 10 ,(2002) , 10.1145/792550.792552