Prosody-based detection of the context of backchannel responses.

作者: Yasuharu Den , Hiroaki Noguchi

DOI:

关键词:

摘要: ABSTRACT Current spoken dialogue systems lack positive feedback such asbackchannels, which are common in human-human conversa-tions. To develop more natural human-computer interfaces, theinvestigation of backchannel-responses indispensable. In thispaper, we propose a method for detecting the precise timing forbackchannel responses Japanese and aim at incorporating suchmethod future systems. The proposed methodis based on machine learning technique with variety prosodicfeatures. It is shownto be effectivein automatically derivingrulesfor contexts backchannels. performance ofour considerably better than previous methods. 1. INTRODUCTION Many researchers have reported that people hesitate to talk withspokendialogue due positivefeedback fromthe as backchannels, conversations [3, 6]. investigation backchannel-responsemechanisms this paper, amethodfordetecting precisetiming responsesin spo-ken systems.In method, backchannels de-tected by using only prosodic features fundamental fre-quency energy, relatively easy handle currentspeech technology. contrast existing methods, whichuse very limited number hand-made heuristics, weemploy varietyof fea-tures might relevant detection backchannelcontext. will shown our effective automati-cally deriving rules contextsof andthat it performs methods.In Section 2, review related works inJapanese conversation automatic forbackchannels. 3, describe cor-pus used study provide definition backchannels.In 4, conduct psychological experiment order tocategorize negative whichare average humans. 5, obtain, us-ing decision tree cues best dis-criminate InSection 6, summarize paper.

参考文章(8)
Steven L. Salzberg, Alberto Segre, Programs for Machine Learning ,(1994)
Hanae Koiso, Yasuo Horiuchi, Syun Tutiya, Akira Ichikawa, Yasuharu Den, An Analysis of Turn-Taking and Backchannels Based on Prosodic and Syntactic Features in Japanese Map Task Dialogs: Language and Speech. ,vol. 41, pp. 295- 321 ,(1998) , 10.1177/002383099804100404
Anne Johnstone, Umesh Berry, Tina Nguyen, Alan Asper, There was a long pause International Journal of Human-computer Studies \/ International Journal of Man-machine Studies. ,vol. 42, pp. 383- 411 ,(1995) , 10.1006/IJHC.1995.1018
Amy Isard, Gwyneth Doherty-Sneddon, Jacqueline C. Kowtko, Jean Carletta, Anne H. Anderson, Stephen Isard, The reliability of a dialogue structure coding scheme Computational Linguistics. ,vol. 23, pp. 13- 31 ,(1997) , 10.5555/972684.972686
J. Ross Quinlan, C4.5: Programs for Machine Learning ,(1992)
N. Ward, Using prosodic clues to decide when to produce back-channel utterances international conference on spoken language processing. ,vol. 3, pp. 1728- 1731 ,(1996) , 10.1109/ICSLP.1996.607961
Y. Okato, K. Kato, M. Kamamoto, S. Itahashi, Insertion of interjectory response based on prosodic information Proceedings of IVTTA '96. Workshop on Interactive Voice Technology for Telecommunications Applications. pp. 85- 88 ,(1996) , 10.1109/IVTTA.1996.552766