DBN versus HMM for Gesture Recognition in Human-Robot Interaction

作者: G. Infantes , I. Ferrané , B. Burger , F. Lerasle

DOI:

关键词:

摘要: Abstract: We designed an easy-to-use user interface based on speech and gesture modalities for controling interactive robot. This paper, after a brief description of this the platform which it is implemented, describes embedded recognition system part multimodal interface. describe two methods, namely Hidden Markov Models Dynamic Bayesian Networks, discuss their relative performance task in our Human-Robot interaction context. The implementation DBN-based outlined some quantitative results are shown. I. INTRODUCTIONSince assistant robots to directly interact with people, finding natural interfaces fundamental importance [1]. Nevertheless, few robotic systems currently equipped completely on-board enabling robot control through communication channels like speech, or both. most advanced one [2] constraint 3D pointing gestures has been developed, but limited mono-manual gestures. In other works, [3] [4], often extracted from monocular images, loosing depth information thus losing capability dealing than directional. With intention providing called Jido such interface, we developed both as well module fusing these results. merging step enables to:− complete underspecified sentence, abbreviation omission, usual human particularly if can be done even used instead− strengthen each modality by improving classification rates commands thanks probabilistic merge results.In framework, paper focuses one- two-handed given video stream delivered stereo head, physical constraints imposed autonomous background: mobility platform, shared computational power, memory capacities, etc.First section background it, leading explanation needs recognition. Next, (HMM) Networks (DBN) task, output visual tracker devoted upper body extremities [5]. Then, outlined. more precisely data clustering process carried out Kohonen network, model training made means Expectation-Maximization algorithm performed using particle filtering [6]. Finally, qualitative symbolic deictic database presented. DBN representation, commonly activity recognition, shown outperform HMM representation especially terms CPU time consuming segmentation.

参考文章(10)
Maria Fox, Malik Ghallab, Guillaume Infantes, Derek Long, Robot introspection through learned hidden Markov models Artificial Intelligence. ,vol. 170, pp. 59- 113 ,(2006) , 10.1016/J.ARTINT.2005.05.007
Philipp W.L. Große, Hartwig Holzapfel, Alex Waibel, Confidence based multimodal fusion for person identification Proceeding of the 16th ACM international conference on Multimedia - MM '08. pp. 885- 888 ,(2008) , 10.1145/1459359.1459513
O. Rogalla, M. Ehrenmann, R. Zollner, R. Becher, R. Dillmann, Using gesture and speech control for commanding a robot assistant robot and human interactive communication. pp. 454- 459 ,(2002) , 10.1109/ROMAN.2002.1045664
Terrence Fong, Illah Nourbakhsh, Kerstin Dautenhahn, A survey of socially interactive robots Robotics and Autonomous Systems. ,vol. 42, pp. 143- 166 ,(2003) , 10.1016/S0921-8890(02)00372-X
M. Yoshizaki, A. Nakamura, Y. Kuno, Mutual assistance between speech and vision for human-robot interface intelligent robots and systems. ,vol. 2, pp. 1308- 1313 ,(2002) , 10.1109/IRDS.2002.1043935
L.R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition Proceedings of the IEEE. ,vol. 77, pp. 267- 296 ,(1989) , 10.1109/5.18626
V.I. Pavlovic, R. Sharma, T.S. Huang, Visual interpretation of hand gestures for human-computer interaction: a review IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 19, pp. 677- 695 ,(1997) , 10.1109/34.598226
B. Burger, B. Burger, B. Burger, F. Lerasle, F. Lerasle, I. Ferrane, I. Ferrane, A. Clodic, Mutual assistance between speech and vision for human-robot interaction intelligent robots and systems. pp. 4011- 4016 ,(2008) , 10.1109/IROS.2008.4651085
Youtian Du, Feng Chen, Wenli Xu, Yongbin Li, Recognizing Interaction Activities using Dynamic Bayesian Network international conference on pattern recognition. ,vol. 1, pp. 618- 621 ,(2006) , 10.1109/ICPR.2006.977
Guillaume Infantes, Malik Ghallab, Félix Ingrand, Learning Behaviors Models for Robot Execution Control. international conference on automated planning and scheduling. pp. 394- 397 ,(2006)