An XML-based Representation Format for Syntactically Annotated Corpora.

作者: Andreas Mengel , Wolfgang Lezius

DOI:

关键词:

摘要: This paper discusses a general approach to the description and encoding of linguistic corpora annotated with hierarchically structured syntactic information. A format can be motivated by variety incompatibility existing annotation formats. By using XML as representation theoretical technical problems encountered overcome.

参考文章(8)
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Hans Uszkoreit, Thorsten Brants, Brigitte Krenn, Wojciecb Skut, A lingnistically interpreted corpus of German newspaper text language resources and evaluation. pp. 705- 712 ,(1998)
Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, Extensible markup language World Wide Web. ,vol. 2, pp. 29- 66 ,(1997) , 10.5555/274784.273625
Charles F. Goldfarb, The SGML handbook ,(1990)
Andreas Mengel, Ulrich Heid, Query language for access to speech corpora Journal of the Acoustical Society of America. ,vol. 105, pp. 1093- 1093 ,(1999) , 10.1121/1.425122
Esther König, Wolfgang Lezius, A description language for syntactically annotated corpora international conference on computational linguistics. pp. 1056- 1060 ,(2000) , 10.3115/992730.992804
Hans Uszkoreit, Thorsten Brants, Wojciech Skut, Brigitte Krenn, A Linguistically Interpreted Corpus of German Newspaper Text arXiv: Computation and Language. ,(1998)