An Empirical Comparison of Unknown Word Prediction Methods

作者: Gertjan van Noord , Valia Kordoni , Kostadin Cholakov , Yi Zhang

DOI:

关键词:

摘要: We compare two types of methods which deal with unknown words in the context computational grammars. Methods first type are based on idea supertagging and use a tagger to predict lexical descriptions for tokens given input. The second perform acquisition (LA) which, this paper, refers automatic new entries lexicon grammar. compared effect their application has parsing coverage accuracy GG grammar German (Crysmann, 2003). In particular, we adapt LA method Cholakov van Noord (2010) was originally developed Dutch Alpino system be used GG. Its impact test corpus newspaper texts is results reported previously same employed tagger. Furthermore, smaller experiment, show that linguistic knowledge provides can also sentence realisation.

参考文章(23)
Jun'ichi Tsujii, Takuya Matsuzaki, Yao-zhong Zhang, A Simple Approach for HPSG Supertagging Using Dependency Information north american chapter of the association for computational linguistics. pp. 645- 648 ,(2010)
Valia Kordoni, Rebecca Dridan, Jeremy Nicholson, Timothy Baldwin, Yi Zhang, Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German language resources and evaluation. ,(2008)
Valia Kordoni, Yi Zhang, Automated Deep Lexical Acquisition for Robust Open Texts Processing language resources and evaluation. pp. 275- 280 ,(2006)
Gertjan van Noord, Kostadin Cholakov, Combining Finite State and Corpus-based Techniques for Unknown Word Prediction recent advances in natural language processing. pp. 60- 64 ,(2009)
Gregor Erbach, Syntactic Processing of Unknown Words artificial intelligence: methodology, systems, applications. pp. 371- 381 ,(1990) , 10.1016/B978-0-444-88771-9.50046-5
Dan Flickinger, Ann A. Copestake, An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG language resources and evaluation. ,(2000)
Aravind K. Joshi, Srinivas Bangalore, Supertagging: an approach to almost parsing Computational Linguistics. ,vol. 25, pp. 237- 265 ,(1999)
ULRICH CALLMEIER, PET – a platform for experimentation with efficient HPSG processing techniques Natural Language Engineering. ,vol. 6, pp. 99- 107 ,(2000) , 10.1017/S1351324900002369
Frederik Fouvry, Lexicon acquisition with a large-coverage unification-based grammar conference of the european chapter of the association for computational linguistics. pp. 87- 90 ,(2003) , 10.3115/1067737.1067755
Adam Kilgarriff, Gregory Grefenstette, Introduction to the special issue on the web as corpus Computational Linguistics. ,vol. 29, pp. 333- 347 ,(2003) , 10.1162/089120103322711569