: serialising the ISO SynAF syntactic object model

作者: Laurent Romary , Amir Zeldes , Florian Zipser

DOI: 10.1007/S10579-014-9288-X

关键词:

摘要: This paper introduces , an XML format developed to serialise the object model defined by ISO Syntactic Annotation Framework SynAF. Based on widespread best practices we adapt a popular for syntactic annotation, TigerXML, with additional features support variety of phenomena including constituent and dependency structures, binding, different node types such as compounds or empty elements. We also define interfaces other formats standards Morpho-syntactic MAF ISOCat Data Category Registry. Finally case study German Treebank TueBa-D/Z is presented, showcasing handling topological fields coreference annotation in tandem.

参考文章(25)
David Steinberg, Ed Merks, Marcelo Paternostro, Frank Budinsky, EMF: Eclipse Modeling Framework 2.0 Addison-Wesley Professional. ,(2009)
Britta Schasberger, Robert MacIntyre, Mary Ann Marcinkiewicz, Karen Katz, Ann Bies, Victoria Tredinnick, Mark Ferguson, Grace Kim, Bracketing Guidelines for Treebank II Style ,(2002)
Antonio Pareja-Lora, Amir Zeldes, Kiyong Lee, Éric Villemonte de la Clergerie, Sonja Bosch, Alex Chengyu Fang, Gertrud Faass, Andreas Witt, Laurent Romary, Florian Zipser, S. Choi, [tiger2] As a standardized serialisation for ISO 24615 - SynAF TLT11 - 11th international workshop on Treebanks and Linguistic Theories - 2012. pp. 37- 60 ,(2012)
Linguistic Computing, Lou Burnard, C. M. Sperberg-McQueen, Guidelines for electronic text encoding and interchange Text Encoding Initiative. ,(1994)
Robert MacIntyre, Karen Katz, Ann Bies, Mark Ferguson, Bracketing Guidelines For Treebank II Style Penn Treebank Project ,(1995)
Heike Telljohann, Erhard Hinrichs, Sandra Kübler, None, The Tüba-D/Z Treebank: Annotating German with a Context-Free Backbone language resources and evaluation. ,(2004)
Nancy Ide, Laurent Romary, Encoding Syntactic Annotation Treebanks. pp. 281- 296 ,(2003) , 10.1007/978-94-010-0201-1_16
Laurent Romary, Andreas Witt, Data Formats for Phonological Corpora arXiv: Computation and Language. ,(2014) , 10.1093/OXFORDHB/9780199571932.013.005
Christian Chiarcos, Anke Lüdeling, ANNIS: A Search Tool for Multi-Layer Annotated Corpora Proceedings of the Corpus Linguistics Conference 2009 (CL2009),, 2009, pág. 358. pp. 358- ,(2009) , 10.18452/13437