The Prague Dependency Treebank

作者： Alena Böhmová , Jan Hajič , Eva Hajičová , Barbora Hladká

关键词: Dependency (UML) 、 Syntax (programming languages) 、 Natural language processing 、 Scheme (programming language) 、 Annotation 、 Czech 、 Treebank 、 Artificial intelligence 、 Field (computer science) 、 Computational linguistics 、 Computer science

摘要: The availability of annotated data (with as rich and “deep” annotation possible) is desirable in any new developments. Textual are being used for so-called training phase various empirical methods solving problems the field computational linguistics. While there many that use texts their plain (or raw) form (in most cases unsupervised training), more accurate results may be obtained if corpora available. itself a complex task. morphologically (pioneered by Henry Kucera 60’s) now available English other languages, syntactically rare. Inspired Penn Treebank, widely corpus English, we decided to develop similarly sized Czech with scheme.

springer.com 本地加速

doi.org 本地加速

sci-hub.st HTML 下载加速

参考文章(8)

Petr Sgall, Eva Hajicova, Jarmila Panevova, Language resources need annotations to make them really reusable: the Prague dependency tree bank language resources and evaluation. pp. 713- 718 ,(1998)

Československá akademie věd, Petr Sgall, Generativní popis jazyka a česká deklinace Academia. ,(1967)

Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556

Jan Hajic, Barbora Hladká, None, Probabilistic and Rule-Based Tagger of an Inflective Language- a Comparison conference on applied natural language processing. pp. 111- 118 ,(1997) , 10.3115/974557.974574

Mitchell Marcus, Grace Kim, Mary Ann Marcinkiewicz, Robert MacIntyre, Ann Bies, Mark Ferguson, Karen Katz, Britta Schasberger, The Penn Treebank Proceedings of the workshop on Human Language Technology - HLT '94. pp. 114- 119 ,(1994) , 10.3115/1075812.1075835

Michael John Collins, A new statistical parser based on bigram lexical dependencies Proceedings of the 34th annual meeting on Association for Computational Linguistics -. pp. 184- 191 ,(1996) , 10.3115/981863.981888

Jan Hajic, Barbora Hladká, None, Tagging inflective languages: prediction of morphological categories for a rich, structured tagset the 36th annual meeting. pp. 483- ,(1998) , 10.3115/980845.980927

Keh-Jiann Chen, Chi-Ching Luo, Ming-Chung Chang, Feng-Yi Chen, Chao-Jan Chen, Chu-Ren Huang, Zhao-Ming Gao, Sinica Treebank Treebanks. pp. 231- 248 ,(2003) , 10.1007/978-94-010-0201-1_13

The Prague Dependency Treebank

来源期刊

我的账户

The Prague Dependency Treebank

来源期刊

相似文章 10

我的账户