Ensembles of diverse clustering-based discriminative dependency parsers

作者: Marzieh Razavi

DOI:

关键词:

摘要: Syntactic parsing and dependency in particular are a core component of many Natural Language Processing (NLP) tasks applications. Improvements can help improve machine translation information extraction applications among others. In this thesis, we extend the framework (Koo, Carreras, Collins, 2008) for which uses single clustering method semi-supervised learning. We make use multiple diverse methods to build discriminative models Maximum Spanning Tree (MST) (McDonald, Crammer, Pereira, 2005). All these clustering-based parsers then combined together using novel ensemble model, performs exact inference on shared hypothesis space all parser models. show that significantly improves unlabeled accuracy from 90.82% 92.46% Section 23 Penn Treebank. also significant improvements domain adaptation Switchboard Brown corpora.

参考文章(66)
Yair Al Censor, Stavros A. Zenios, Parallel Optimization: Theory, Algorithms, and Applications ,(1997)
Xavier Carreras, Experiments with a Higher-Order Projective Dependency Parser empirical methods in natural language processing. pp. 957- 961 ,(2007)
Fernando Pereira, Ryan Mcdonald, Discriminative learning and spanning tree algorithms for dependency parsing University of Pennsylvania. ,(2006)
Alex Zamanian, Scott Miller, Jethran Guinness, Name Tagging with Word Clusters and Discriminative Training north american chapter of the association for computational linguistics. pp. 337- 342 ,(2004)
Yuji Matsumoto, Hiroyasu Yamada, Statistical Dependency Analysis with Support Vector Machines international workshop/conference on parsing technologies. pp. 195- 206 ,(2003)
Fernando C. N. Pereira, Ryan T. McDonald, Online Learning of Approximate Dependency Parsing Algorithms. conference of the european chapter of the association for computational linguistics. ,(2006)
Douglas Brent West, Introduction to Graph Theory ,(1995)
Joakim Nivre, Incremental Non-Projective Dependency Parsing north american chapter of the association for computational linguistics. pp. 396- 403 ,(2007)
Joakim Nivre, Ryan McDonald, Characterizing the Errors of Data-Driven Dependency Parsing Models empirical methods in natural language processing. pp. 122- 131 ,(2007)
Aravind Joshi, Liang Huang, Kevin Knight, Statistical syntax-directed translation with extended domain of locality conference of the association for machine translation in the americas. pp. 66- 73 ,(2006)