Evaluating integrated speech- and image understanding

作者: C. Bauckhage , J. Fritsch , K.J. Rohifing , S. Wachsmuth , G. Sagerer

DOI: 10.1109/ICMI.2002.1166961

关键词: MultimediaSpeech processingAutomatic speech processingUsabilityComputer scienceUser interfaceMultimodal interactionRobustness (computer science)SketchImage processing

摘要: The capability to coordinate and interrelate speech vision is a virtual prerequisite for adaptive, cooperative, flexible interaction among people. It therefore assume that human-machine interaction, too, would benefit from intelligent interfaces integrated image processing. In this paper, we first sketch an interactive system integrates automatic processing with understanding. Then, concentrate on performance assessment which believe emerging key issue in multimodal interaction. We explain the of time scale analysis usability studies evaluate our accordingly.

参考文章(23)
Hans Brandt-Pook, Sven Wachsmuth, Gerhard Sagerer, Gernot A. Fink, Integrated Recognition and Interpretaion of Speech for a Construction Task Domain hci international conference. ,vol. 1, pp. 554- ,(1999)
Gerhard Sagerer, Heinrich Niemann, Semantic Networks for Understanding Scenes ,(1997)
Gernot A. Finkco], Developing HMM-Based Recognizers with ESMERALDA text speech and dialogue. pp. 229- 234 ,(1999) , 10.1007/3-540-48239-3_42
Franz Kummert, Gerhard Sagerer, Jannik Fritsch, Christian Bauckhage, Towards a Vision System for Supervising Assembly Processes Proc. Symposium on Intelligent Robotic Systems (SIRS’99). ,(1999)
Takuya Takahashi, Satoru Nakanishi, Yoshinori Kuno, Yoshiaki Shirai, None, Helping computer vision by verbal and nonverbal communication international conference on pattern recognition. ,vol. 2, pp. 1216- 1218 ,(1998) , 10.1109/ICPR.1998.711917
Tom Brøndsted, Kristian Grønborg Olesen, Thomas B. Moeslund, Paul McKevitt, Lars Bo Larsen, Michael Manthey, The Intellimedia WorkBench: a Generic Environment for Multimodal Systems conference of the international speech communication association. pp. 273- 276 ,(1998)
Sven Wachsmuth, Gerhard Sagerer, Gernot A. Fink, Integration of parsing and incremental speech recognition european signal processing conference. ,vol. 1, pp. 1- 4 ,(1998)
Christian Bauckhage, Susanne Kronenberg, Franz Kummert, Gerhard Sagerer, Grammars and Discourse Theory to Describe and Recognize Mechanical Assemblies Lecture Notes in Computer Science. ,vol. 1876, pp. 173- 182 ,(2000) , 10.1007/3-540-44522-6_18
C. Bauckhage, G.A. Fink, J. Fritsch, F. Kummmert, F. Lomker, G. Sagerer, S. Wachsmuth, An integrated system for cooperative man-machine interaction computational intelligence in robotics and automation. pp. 320- 325 ,(2001) , 10.1109/CIRA.2001.1013219
S. Oviatt, R. VanGent, Error resolution during multimodal human-computer interaction international conference on spoken language processing. ,vol. 1, pp. 204- 207 ,(1996) , 10.1109/ICSLP.1996.607077