Toward a quizmaster robot for speech-based multiparty interaction

作者: Izaya Nishimuta , Katsutoshi Itoyama , Kazuyoshi Yoshii , Hiroshi G. Okuno

DOI: 10.1080/01691864.2015.1079504

关键词:

摘要: This paper presents an interactive quizmaster robot that can manage a multiparty speech-based quiz game. The basic flow of the game is (1) reads question, (2) one or more players answer it, and (3) judges correctness answers. We categorize such games into school-type interaction auction-type interaction. former asks to say ‘Yes’ get right before answering question latter allows directly without any advance notice. To realize interaction, needs capability recognizing utterances from multiple people using its own microphones (i.e. ears), even if those are made simultaneously. cope with situations, estimates which player fastest utterance recognizes it by localizing separating mixture audio signals. Experiments were conducted evaluate success rates identification speech rec...

参考文章(35)
Shotaro Akaho, Yoichi Motomura, Isao Hara, Toshihiro Matsui, Satoru Hayamizu, Hideki Asoh, Socially embedded learning of the office-conversant mobile robot Jijo-2 international joint conference on artificial intelligence. pp. 880- 885 ,(1997)
Tatsuya Kawahara, Akinobu Lee, Recent Development of Open-Source Speech Recognition Engine Julius asia pacific signal and information processing association annual summit and conference. pp. 131- 137 ,(2009)
Katunobu Itou, Futoshi Asano, Masataka Goto, Hideki Asoh, Real-time sound source localization and separation system and its application to automatic speech recognition conference of the international speech communication association. pp. 1013- 1016 ,(2001)
Eva González-Parada, José Manuel Cano-García, Marcos Santos-Pérez, Topic-Dependent Language Model Switching for Embedded Automatic Speech Recognition ISAmI. pp. 235- 242 ,(2012) , 10.1007/978-3-642-28783-1_30
François Grondin, Dominic Létourneau, François Ferland, Vincent Rousseau, François Michaud, The ManyEars open framework Autonomous Robots. ,vol. 34, pp. 217- 232 ,(2013) , 10.1007/S10514-012-9316-X
Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim, Robot Audition: Missing Feature Theory Approach and Active Audition Springer Tracts in Advanced Robotics. pp. 227- 244 ,(2011) , 10.1007/978-3-642-19457-3_14
I.R. Lane, T. Kawahara, T. Matsui, Language model switching based on topic detection for dialog speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 616- 619 ,(2003) , 10.1109/ICASSP.2003.1198856
Erkki Oja, Aapo Hyvarinen, Juha Karhunen, Independent Component Analysis ,(2001)
Hikaru Taniyama, Yoichi Matsuyama, Tetsunori Kobayashi, Shinya Fujie, Framework of Communication Activation Robot Participating in Multiparty Conversation national conference on artificial intelligence. pp. 68- 73 ,(2010)
Joao Lobato Oliveira, Gokhan Ince, Keisuke Nakamura, Kazuhiro Nakadai, Hiroshi G. Okuno, Luis Paulo Reis, Fabien Gouyon, An active audition framework for auditory-driven HRI: Application to interactive robot dancing 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication. pp. 1078- 1085 ,(2012) , 10.1109/ROMAN.2012.6343892