The Role of Speech Technology in User Perception and Context Acquisition in HRI

作者: Jorge Wuth , Pedro Correa , Tomás Núñez , Matías Saavedra , Néstor Becerra Yoma

DOI: 10.1007/S12369-020-00682-5

关键词:

摘要: The role and relevance of speech synthesis recognition in social robotics is addressed this paper. To increase the generality study, interaction a human being with one two robots when executing tasks was considered. By making use these scenarios, state-of-the-art synthesizer compared non-linguistic utterances (1) from preference (2) perception robots’ capabilities, (3) typed text to input commands regarding user preference, (4) importance knowing context (5) synthetic voice acquire were evaluated. Speech are different technologies but generating understanding should be understood as dimensions same spoken language phenomenon. Also, robot denotes all information about operating conditions completeness status task that executed by robot. Two robotic setups for online experiments built. With first setup, where only employed, our findings indicate that: highly natural preferred over beep-like audio; users also prefer enter rather than typing text; and, has more important effect on perceived robot’s capability possibility voice. analysis presented here suggests interacted single robot, its cue cause anthropomorphization lost while carried out could evaluate better respect task. In experiment second two-robot collaborative testbed employed. When communicated each other sort problems they trying accomplish mission, observed situation distanced position “reflective” perspective dominated. Our results essential successful human–robot collaboration given objective. For purpose, synthesized screen acquisition.

参考文章(75)
Nicole C. Krämer, Astrid von der Pütten, Sabrina Eimler, Human-Agent and Human-Robot Interaction Theory: Similarities to and Differences from Human-Human Interaction Human-Computer Interaction: The Agency Perspective. pp. 215- 240 ,(2012) , 10.1007/978-3-642-25691-2_9
Joseph B. Lyons, Paul R. Havig, Transparency in a Human-Machine Context: Approaches for Fostering Shared Awareness/Intent international conference on virtual augmented and mixed reality. pp. 181- 190 ,(2014) , 10.1007/978-3-319-07458-0_18
Leila Takayama, Perspectives on Agency Interacting with and through Personal Robots Human-Computer Interaction: The Agency Perspective. pp. 195- 214 ,(2012) , 10.1007/978-3-642-25691-2_8
Roger K. Moore, Spoken Language Processing: Time to Look Outside? International Conference on Statistical Language and Speech Processing. pp. 21- 36 ,(2014) , 10.1007/978-3-319-11397-5_2
Graham Cassford, Jack Hollingum, Speech Technology at Work ,(1988)
Shelley Shwu-Ching Young, Yi Hsuan Wang, Jyh-Shing Roger Jang, Exploring perceptions of integrating tangible learning companions in learning English conversation British Journal of Educational Technology. ,vol. 41, pp. 78- ,(2010) , 10.1111/J.1467-8535.2009.00989.X
Bernhard Jung, Stefan Kopp, FlurMax: An Interactive Virtual Agent for Entertaining Visitors in a Hallway intelligent virtual agents. ,vol. 2792, pp. 23- 26 ,(2003) , 10.1007/978-3-540-39396-2_5
Martin Klesen, Stephan Baldes, Michael Kipp, Patrick Gebhard, Peter Rist, Markus Schmitt, Thomas Rist, CrossTalk: An Interactive Installation with Animated Presentation Agents ,(2002)
Hannes Vilhjálmsson, Tim Bickmore, Lee Campbell, Hao Yan, Justine Cassell, Human conversation as a system framework: designing embodied conversational agents Embodied conversational agents. pp. 29- 63 ,(2001)
F. Michaud, A. Duquette, I. Nadeau, Characteristics of mobile robotic toys for children with pervasive developmental disorders systems, man and cybernetics. ,vol. 3, pp. 2938- 2943 ,(2003) , 10.1109/ICSMC.2003.1244338