The Prague Dependency Treebank

作者: Alena Böhmová , Jan Hajič , Eva Hajičová , Barbora Hladká

DOI: 10.1007/978-94-010-0201-1_7

关键词: Dependency (UML)Syntax (programming languages)Natural language processingScheme (programming language)AnnotationCzechTreebankArtificial intelligenceField (computer science)Computational linguisticsComputer science

摘要: The availability of annotated data (with as rich and “deep” annotation possible) is desirable in any new developments. Textual are being used for so-called training phase various empirical methods solving problems the field computational linguistics. While there many that use texts their plain (or raw) form (in most cases unsupervised training), more accurate results may be obtained if corpora available. itself a complex task. morphologically (pioneered by Henry Kucera 60’s) now available English other languages, syntactically rare. Inspired Penn Treebank, widely corpus English, we decided to develop similarly sized Czech with scheme.

参考文章(8)
Petr Sgall, Eva Hajicova, Jarmila Panevova, Language resources need annotations to make them really reusable: the Prague dependency tree bank language resources and evaluation. pp. 713- 718 ,(1998)
Československá akademie věd, Petr Sgall, Generativní popis jazyka a česká deklinace Academia. ,(1967)
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Jan Hajic, Barbora Hladká, None, Probabilistic and Rule-Based Tagger of an Inflective Language- a Comparison conference on applied natural language processing. pp. 111- 118 ,(1997) , 10.3115/974557.974574
Mitchell Marcus, Grace Kim, Mary Ann Marcinkiewicz, Robert MacIntyre, Ann Bies, Mark Ferguson, Karen Katz, Britta Schasberger, The Penn Treebank Proceedings of the workshop on Human Language Technology - HLT '94. pp. 114- 119 ,(1994) , 10.3115/1075812.1075835
Michael John Collins, A new statistical parser based on bigram lexical dependencies Proceedings of the 34th annual meeting on Association for Computational Linguistics -. pp. 184- 191 ,(1996) , 10.3115/981863.981888
Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang, Zhao-Ming Gao, Sinica Treebank Treebanks. pp. 231- 248 ,(2003) , 10.1007/978-94-010-0201-1_13