Tree Distance and Some Other Variants of Evalb

作者: Martin Emms

DOI:

关键词:

摘要: Some alternatives to the standard evalb measures for parser evaluation are considered, principally use of a tree-distance measure, which assigns score linearity and ancestry respecting mapping between trees, in contrast measures, assign span preserving mapping. Additionally, analysis suggests some further variants, concerning different normalisations, portions tree compared whether scores should be micro or macro averaged. The outputs 6 parsing systems on Section 23 Penn Treebank were taken. It is shown that ranking varies as alternative used. For fixed system, it also parses from best worst will vary according measure argued ameliorates problem has been noted over-penalisation attachment errors.

参考文章(2)
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Slav Petrov, Leon Barrett, Romain Thibaux, Dan Klein, Learning Accurate, Compact, and Interpretable Tree Annotation meeting of the association for computational linguistics. pp. 433- 440 ,(2006) , 10.3115/1220175.1220230