Building Uyghur Dependency Treebank: Design Principles, Annotation Schema and Tools

作者: Mairehaba Aili , Aziguli Xialifu , Maihefureti , Saimaiti Maimaitimin

DOI: 10.1007/978-3-319-31468-6_9

关键词: Artificial intelligenceRepresentation (mathematics)Annotation schemaLanguage GridNatural language processingDesign elements and principlesProcess (engineering)TreebankAnnotationDependency (project management)Computer science

摘要: Treebank is a crucial source of information for NLP and linguistic researches. In this paper, we describe the process building Uyghur dependency treebank, including designing principles, annotation schemas tools corpus creation. The built from public readings corpora, employed multi-tier representation extending future use, created about 23 relations. This paper presents preliminary results project an overview new idea combining with Language Grid.

参考文章(19)
Saso Dzeroski, Petr Pajas, Zdenek Zabokrtský, Tomaz Erjavec, Nina Ledinek, Anreja Zele, Towards a Slovene Dependency Treebank language resources and evaluation. pp. 1388- 1391 ,(2006)
Kemal Oflazer, Bilge Say, Dilek Zeynep Hakkani-Tür, Gökhan Tür, Building a Turkish Treebank Treebanks. pp. 261- 277 ,(2003) , 10.1007/978-94-010-0201-1_15
Vladislav Kubon, Jan Hajic, Martin Cmejrek, Jan Curín, Jirí Havelka, Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation language resources and evaluation. ,(2004)
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Anne Abeillé, Lionel Clément, François Toussenel, Building a Treebank for French Treebanks. pp. 165- 187 ,(2003) , 10.1007/978-94-010-0201-1_10
Tuomo Kakkonen, DepAnn - An Annotation Tool for Dependency Treebanks arXiv: Computation and Language. ,(2006)
Sabine Buchholz, Erwin Marsi, CoNLL-X Shared Task on Multilingual Dependency Parsing conference on computational natural language learning. pp. 149- 164 ,(2006) , 10.3115/1596276.1596305
Gülşen Eryiğit, ITU treebank annotation tool Proceedings of the Linguistic Annotation Workshop on - LAW '07. pp. 117- 120 ,(2007) , 10.3115/1642059.1642078
Samat Mamitimin, Turgun Ibrahim, Marhaba Eli, The Annotation Scheme for Uyghur Dependency Treebank international conference on asian language processing. pp. 185- 188 ,(2013) , 10.1109/IALP.2013.56
Igor Boguslavsky, Svetlana Grigorieva, Nikolai Grigoriev, Leonid Kreidlin, Nadezhda Frid, Dependency treebank for Russian: concept, tools, types of information international conference on computational linguistics. pp. 987- 991 ,(2000) , 10.3115/992730.992790