A Syntax-first Approach to High-quality Morphological Analysis and Lemma Disambiguation for the TüBa-D/Z Treebank

作者: Heike Telljohann , Yannick Versley , Kathrin Beck , Erhard Hinrichs

DOI:

关键词:

摘要: Morphological analyses and lemma information are an important auxiliary resource for any treebank, especially morphologically rich languages since such is a useful precondition task that needs to link surface forms semantic interpretation (either through wordnets or distributional measures). In contrast common practice in parsing, the method used TuBaD/Z treebank uses syntactic morphological disambiguation. We argue this approach has advantage context of treebanking many ambiguities morphology lemmas can be eliminated given context.

参考文章(8)
Erich Drach, Grundgedanken der deutschen Satzlehre Wissenschaftliche Buchgesellschaft. ,(1963)
Heike Telljohann, Erhard Hinrichs, Sandra Kübler, None, The Tüba-D/Z Treebank: Annotating German with a Context-Free Backbone language resources and evaluation. ,(2004)
Yannick Versley, Ines Rehbein, Scalable discriminative parsing for German Proceedings of the 11th International Conference on Parsing Technologies - IWPT '09. pp. 134- 137 ,(2009) , 10.3115/1697236.1697262
Iryna Gurevych, Hendrik Niederlich, Accessing GermaNet Data and Computing Semantic Relatedness meeting of the association for computational linguistics. pp. 5- 8 ,(2005) , 10.3115/1225753.1225755
Lothar Lemnitzer, Claudia Kunze, GermaNet - representation, visualization, application language resources and evaluation. ,(2002)
Ulrich Heid, Arne Fitschen, Helmut Schmid, SMOR: A German Computational Morphology Covering Derivation, Composition and Inflection language resources and evaluation. ,(2004)