Assessing relevance using automatically translated documents for cross-language information retrieval

作者: Viviane Moreira Orengo

DOI:

关键词: Relevance feedbackForeign languageInformation retrievalNatural language processingRanking (information retrieval)Natural languageConstructed languageCross-language information retrievalArtificial intelligenceMachine translationComputer scienceRelevance (information retrieval)

摘要: This thesis focuses on the Relevance Feedback (RF) process, and scenario considered is that of a Portuguese-English Cross-Language Information Retrieval (CUR) system. CUR deals with retrieval documents in one natural language response to query expressed another language. RF an automatic process for reformulation. The idea behind it users are unlikely produce perfect queries, especially if given just attempt.The aims at improving queryspecification, which will lead more relevant being retrieved. method consists asking user analyse initial sample retrieved judge them relevance. In context, two main questions were posed. first relates user's ability assessing relevance texts foreign language, hand translated into their automatically second question concerns relationship between accuracy participant's judgements improvement achieved through process. In order answer those questions, this work performed experiment Portuguese speakers asked English documents, hand-translated Portuguese, Portuguese. results show machine translation as effective aiding assess relevance. In addition, impact misjudged documents performance overall moderate, varies greatly different topics. This advances existing research by considering carrying out experiments, aspects remained unexplored until now. contributions also include: investigation using new pair; design implementation stemming algorithm Portuguese; several experiments Latent Semantic Indexing contribute data points theory.

参考文章(109)
Joachim Voeller, The Downward Spiral Springer Berlin Heidelberg. pp. 701- 713 ,(1993) , 10.1007/978-3-642-78508-5_67
Martin Braschler, Donna Harman, Carol Peters, Michael Hess, Peter Schäuble, Michael Kluck, The Evaluation of Systems for Cross-language Information Retrieval language resources and evaluation. ,(2000)
Lisa A. Ballesteros, Cross-Language Retrieval via Transitive Translation The Information Retrieval Series. pp. 203- 234 ,(2002) , 10.1007/0-306-47019-5_8
Endong Xun, Changning Huang, Jianfeng Gao, Ming Zhou, Jian-Yun Nie, Yi Su, Jian Zhang, TREC-9 CLIR Experiments at MSRCN. text retrieval conference. ,(2000)
S. T. Dumais, Using LSI for information filtering: TREC-3 experiments text retrieval conference. ,(1995)
J. Scott McCarley, Martin Franz, Arabic Information Retrieval at IBM. text retrieval conference. ,(2002)
Gerard Salton, The SMART retrieval system ,(1971)
Jussi Karlgren, Preben Hansen, Cross-Language Relevance Assessment and Task Context. CLEF (Working Notes). ,(2002)