Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues

作者: Paul Piwek , Svetlana Stoyanchev

DOI:

关键词: Dialogue actsComputer scienceNatural language processingParaphraseLinguisticsRhetorical questionArtificial intelligenceCorpus linguisticsAnnotationStructure (mathematical logic)Coda

摘要: We describe the construction of CODA corpus, a parallel corpus monologues and expository dialogues. The dialogue part consists expository, i.e., information-delivering rather than dramatic, dialogues written by several acclaimed authors. monologue is paraphrase in form these human annotator. was constructed as resource for extracting rules automated generation from monologue. Using authored allows us to analyse techniques used accomplished writers presenting information dialogue. are annotated with acts rhetorical structure. developed annotation translation guidelines together custom-developed tool carrying out translation, alignment annotation.

参考文章(14)
S. Yamada, S.V. Suzuki, Persuasion through overheard communication by life-like agents ieee wic acm international conference on intelligent agent technology. pp. 225- 231 ,(2004) , 10.1109/IAT.2004.88
Michael O'Donnell, RSTTool 2.4 - A markup Tool for Rhetorical Structure Theory international conference on natural language generation. pp. 253- 256 ,(2000) , 10.3115/1118253.1118290
Lynn Carlson, Daniel Marcu, Mary Ellen Okurowski, Building a discourse-tagged corpus in the framework of Rhetorical Structure Theory Proceedings of the Second SIGdial Workshop on Discourse and Dialogue -. pp. 1- 10 ,(2001) , 10.3115/1118078.1118083
WILLIAM C. MANN, SANDRA A. THOMPSON, Rhetorical Structure Theory : Toward a Functional Theory of Text Organization Text - Interdisciplinary Journal for the Study of Discourse. ,vol. 8, pp. 243- 281 ,(1988) , 10.1515/TEXT.1.1988.8.3.243
Jacob Cohen, A Coefficient of agreement for nominal Scales Educational and Psychological Measurement. ,vol. 20, pp. 37- 46 ,(1960) , 10.1177/001316446002000104
Mark Core, David Traum, H. Chad Lane, William Swartout, Jonathan Gratch, Michael van Lent, Stacy Marsella, Teaching Negotiation Skills through Practice and Reflection with Virtual Humans international conference on advances in system simulation. ,vol. 82, pp. 685- 701 ,(2006) , 10.1177/0037549706075542
Amy Isard, Gwyneth Doherty-Sneddon, Jacqueline C. Kowtko, Jean Carletta, Anne H. Anderson, Stephen Isard, The reliability of a dialogue structure coding scheme Computational Linguistics. ,vol. 23, pp. 13- 31 ,(1997) , 10.5555/972684.972686
Paul Piwek, Hugo Hernault, Helmut Prendinger, Mitsuru Ishizuka, T2D: Generating Dialogues Between Virtual Agents Automatically from Text intelligent virtual agents. pp. 161- 174 ,(2007) , 10.1007/978-3-540-74997-4_16
Harry Bunt, Dialogue pragmatics and context specification Computational Pragmatics, Abduction, Belief and Context; Studies in Computational Pragmatics. pp. 81- 150 ,(2000)
Paul Piwek, Kees van Deemter, Generating under Global Constraints: The Case of Scripted Dialogue Research on Language and Computation. ,vol. 6, pp. 239- 239 ,(2008) , 10.1007/S11168-008-9049-3