作者: Sabine Brants , Stefanie Dipper , Peter Eisenberg , Silvia Hansen-Schirra , Esther König
DOI: 10.1007/S11168-004-7431-3
关键词:
摘要: This paper reports on the TIGER Treebank, a corpus of currently 40,000 syntactically annotated German newspaper sentences. We describe what kind information is encoded in treebank and introduce different representation formats that are used for annotation exploitation treebank. explain methods annotation: interactive annotation, using tool ANNOTATE, LFG parsing. Furthermore, we give an account scheme extended improved version NEGRA illustrate detail linguistic extensions were made concerning project. The main differences concerned with coordination, verb-subcategorization, expletives as well proper nouns. In addition, also presents query TIGERSearch was developed project to exploit adequate way. language which designed facilitate simple formulation complex queries; furthermore, shortly in, graphical user interface input. concludes summary some directions future work.