Annotation of Discourse Relations for Conversational Spoken Dialogs

作者: Aravind K. Joshi , Giuseppe Riccardi , Sara Tonelli , Rashmi Prasad

DOI:

关键词: HierarchyScheme (programming language)PragmaticsLinguisticsTreebankArtificial intelligenceAdaptation (computer science)Natural language processingPoint (typography)Computer scienceDomain (software engineering)Annotation

摘要: In this paper, we make a qualitative and quantitative analysis of discourse relations within the LUNA conversational spoken dialog corpus. particular, first describe Penn Discourse Treebank (PDTB) then detail adaptation its annotation scheme to corpus Italian task-oriented dialogs in domain software/hardware assistance. We discuss similarities differences between our approach PDTB paradigm point out peculiarities spontaneous w.r.t. written text, which motivated some changes strategy. introduced non-contiguous arguments modified sense hierarchy order take into account important role pragmatics dialogs. final part present comparison connective frequency representative subset PDTB. Such confirmed two corpora corroborates choice introduce dialog-specific adaptations.

参考文章(14)
Irina Prodanof, Silvia Pareti, Annotating Attribution Relations: Towards an Italian Discourse Treebank language resources and evaluation. ,(2010)
Lucie Mladová, Sárka Zikánová, Eva Hajicová, From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank. language resources and evaluation. ,(2008)
Silvia Quarteroni, Sebastian Varges, Giuseppe Riccardi, Arianna Bisazza, An Open-Domain Dialog Act Taxonomy ,(2008)
Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, Alan Lee, Nikhil Dinesh, Bonnie L Webber, Rashmi Prasad, The Penn Discourse Treebank 2.0 Annotation Manual ,(2007)
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Nianwen Xue, Annotating Discourse Connectives in the Chinese Treebank meeting of the association for computational linguistics. pp. 84- 91 ,(2005) , 10.3115/1608829.1608841
Umangi Oza, Rashmi Prasad, Sudheer Kolachina, Dipti Misra Sharma, Aravind Joshi, The Hindi Discourse Relation Bank linguistic annotation workshop. pp. 158- 161 ,(2009) , 10.3115/1698381.1698410
Carla Bazzanella, Phatic connectives as interactional cues in contemporary spoken Italian Journal of Pragmatics. ,vol. 14, pp. 629- 647 ,(1990) , 10.1016/0378-2166(90)90034-B