Parsing with Compositional Vector Grammars

作者: Richard Socher , Christopher D. Manning , Ng Andrew Y. , John Bauer

DOI:

关键词:

摘要: Natural language parsing has typically been done with small sets of discrete categories such as NP and VP, but this representation does not capture the full syntactic nor semantic richness linguistic phrases, attempts to improve on by lexicalizing phrases or splitting only partly address problem at cost huge feature spaces sparseness. Instead, we introduce a Compositional Vector Grammar (CVG), which combines PCFGs syntactically untied recursive neural network that learns syntactico-semantic, compositional vector representations. The CVG improves PCFG Stanford Parser 3.8% obtain an F1 score 90.4%. It is fast train implemented approximately efficient reranker it about 20% faster than current factored parser. soft notion head words performance types ambiguities require information PP attachments.

参考文章(40)
Slav Petrov, Dan Klein, Improved Inference for Unlexicalized Parsing north american chapter of the association for computational linguistics. pp. 404- 411 ,(2007)
Christopher D. Manning, Michael Collins, Daphne Koller, Ben Taskar, Dan Klein, Max-Margin Parsing empirical methods in natural language processing. pp. 1- 8 ,(2004)
James R. Curran, Jonathan K. Kummerfeld, David Hall, Dan Klein, Parser Showdown at the Wall Street Corral: An Empirical Investigation of Error Types in Parser Output empirical methods in natural language processing. pp. 1048- 1059 ,(2012)
Richard Socher, Andrew Y. Ng, Cliff C. Lin, Chris Manning, Parsing Natural Scenes and Natural Language with Recursive Neural Networks international conference on machine learning. pp. 129- 136 ,(2011)
Eugene Charniak, A maximum-entropy-inspired parser north american chapter of the association for computational linguistics. pp. 132- 139 ,(2000)
Dimitri Kartsaklis, Mehrnoosh Sadrzadeh, Stephen Pulman, A Unified Sentence Space for Categorical Distributional-Compositional Semantics: Theory and Experiments international conference on computational linguistics. pp. 549- 558 ,(2012)
Stuart Shieber, Joshua T. Goodman, Parsing inside-out arXiv: Computation and Language. ,(1998)
P. D. Turney, P. Pantel, From frequency to meaning: vector space models of semantics Journal of Artificial Intelligence Research. ,vol. 37, pp. 141- 188 ,(2010) , 10.1613/JAIR.2934
J. Andrew Bagnell, Martin A. Zinkevich, Nathan D. Ratliff, Online) Subgradient Methods for Structured Prediction ,(2007)
Brody Huval, Richard Socher, Andrew Y. Ng, Christopher D. Manning, Semantic Compositionality through Recursive Matrix-Vector Spaces empirical methods in natural language processing. pp. 1201- 1211 ,(2012)