作者: Nikola Ljubeši'c , Darja Fišer , Ozren Kubelka
DOI:
关键词:
摘要: This paper presents an approach to extract translation equivalents from comparable corpora for polysemous nouns. As opposed the standard approaches that build a single context vector all occurrences of given headword, we first disambiguate headword with third-party sense taggers and then separate each headword. Since state-of-the-art word disambiguation tools are still far perfect, also tried improve results by combining assignments provided two different taggers. Evaluation shows outperform baseline (0.473) in settings experimented with, even when using only one tagger, best-performing indeed obtained taking into account intersection both (0.720).