Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank

作者: Marie Mikulová , Jan Stepánek

DOI:

关键词:

摘要: In this paper, we present several ways to measure and evaluate the annotation annotators, proposed used during building of Czech part Prague Czech-English Dependency Treebank. At first, basic principles treebank project are introduced (division three layers: morphological, analytical tectogrammatical). The main paper describes in detail one important phases process: evaluation annotators - inter-annotator agreement, error rate performance. measuring agreement is complicated by fact that data contain added deleted nodes, making alignment between annotations non-trivial. measured a set automatic checking procedures guard validity some invariants data. performance booking web application. All measures later compared related each other.

参考文章(2)
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Václav Klimeš, Analytical and Tectogrammatical Analysis of a Natural Language Univerzita Karlova, Matematicko-fyzikální fakulta. ,(2006)