作者: Aravind K. Joshi , Giuseppe Riccardi , Sara Tonelli , Rashmi Prasad
DOI:
关键词: Hierarchy 、 Scheme (programming language) 、 Pragmatics 、 Linguistics 、 Treebank 、 Artificial intelligence 、 Adaptation (computer science) 、 Natural language processing 、 Point (typography) 、 Computer science 、 Domain (software engineering) 、 Annotation
摘要: In this paper, we make a qualitative and quantitative analysis of discourse relations within the LUNA conversational spoken dialog corpus. particular, first describe Penn Discourse Treebank (PDTB) then detail adaptation its annotation scheme to corpus Italian task-oriented dialogs in domain software/hardware assistance. We discuss similarities differences between our approach PDTB paradigm point out peculiarities spontaneous w.r.t. written text, which motivated some changes strategy. introduced non-contiguous arguments modified sense hierarchy order take into account important role pragmatics dialogs. final part present comparison connective frequency representative subset PDTB. Such confirmed two corpora corroborates choice introduce dialog-specific adaptations.