Using automatically labelled examples to classify rhetorical relations: An assessment

作者: CAROLINE SPORLEDER , ALEX LASCARIDES

DOI: 10.1017/S1351324906004451

关键词:

摘要: Being able to identify which rhetorical relations (e.g., contrast or explanation) hold between spans of text is important for many natural language processing applications. Using machine learning obtain a classifier can distinguish different typically depends on the availability manually labelled training data, very time-consuming create. However, are sometimes lexically marked, i.e., signalled by discourse markers because, but, consequently etc.), and it has been suggested (Marcu Echihabi, 2002) that presence these cues in some examples be exploited label them automatically with corresponding relation. The then removed data used train determine even when no marker present (based other linguistic such as word co-occurrences). In this paper, we investigate empirically how feasible approach is. particular, test whether labelled, marked really suitable material classifiers applied unmarked examples. Our results suggest type may not good strategy, models trained way do seem generalise well data. Furthermore, found evidence behaviour largely independent seems lie itself too dissimilar linguistically removing unambiguous automatic labelling process lead meaning shift examples).

参考文章(40)
Alex Lascarides, Mirella Lapata, Inferring Sentence-internal Temporal Relations north american chapter of the association for computational linguistics. pp. 153- 160 ,(2004)
Simon H. Corston-Oliver, Identifying the Linguistic Correlates of Rhetorical Relations Discourse Relations and Discourse Markers. pp. 7- ,(1998)
Daniel Marcu, Improving summarization through rhetorical parsing tuning meeting of the association for computational linguistics. ,(1998)
Yuji Matsumoto, Tadashi Nomoto, Learning Discourse Relations with Active Data Selection empirical methods in natural language processing. ,(1999)
W C Mann, RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION ISI/RS REPORT. ,vol. 87, pp. 2- 82 ,(1987)
Uwe Reyle, Hans Kamp, From discourse to logic ,(1993)
Thiago Alexandre Salgueiro Pardo, Maria das Graças Volpe Nunes, Lucia Helena Machado Rino, None, DiZer: An Automatic Discourse Analyzer for Brazilian Portuguese Advances in Artificial Intelligence – SBIA 2004. pp. 224- 234 ,(2004) , 10.1007/978-3-540-28645-5_23
Eugene Charniak, A maximum-entropy-inspired parser north american chapter of the association for computational linguistics. pp. 132- 139 ,(2000)
Alex Lascarides, Nicholas Asher, Logics of conversation ,(2003)
Daniel C. Marcu, Graeme Hirst, The rhetorical parsing, summarization, and generation of natural language texts University of Toronto. ,(1998)