Audio-visual integration in multimodal communication

作者： Tsuhan Chen , R.R. Rao

关键词:

摘要: We review recent research that examines audio-visual integration in multimodal communication. The topics include bimodality human speech, and automated lip reading, facial animation, synchronization, joint audio-video coding, bimodal speaker verification. also study the enabling technologies for these topics, including automatic facial-feature tracking audio-to-visual mapping. Recent progress shows processing of audio video provides advantages are not available when processed independently.

参考文章(61)

G. Wolberg, Digital Image Warping IEEE Computer Society Press. ,(1990)

Kerry P. Green, The Use of Auditory and Visual Information in Phonetic Perception Springer, Berlin, Heidelberg. pp. 55- 77 ,(1996) , 10.1007/978-3-662-13015-5_5

Denis Burnham, Barbara Dodd, Auditory-Visual Speech Perception as a Direct Process: The McGurk Effect in Infants and Across Languages Springer, Berlin, Heidelberg. pp. 103- 114 ,(1996) , 10.1007/978-3-662-13015-5_7

Claude C. Chibelushi, John S. Mason, R. Deravi, Integration of acoustic and visual speech for speaker recognition. conference of the international speech communication association. ,(1993)

Eric David Petajan, Automatic lipreading to enhance speech recognition (speech reading) University of Illinois at Urbana-Champaign. ,(1984)

Angela Fuster-Duran, Perception of Conflicting Audio-Visual Speech: an Examination across Spanish and German Springer Berlin Heidelberg. pp. 135- 143 ,(1996) , 10.1007/978-3-662-13015-5_9

Eric Cosatto, Gerasimos Potamianos, Hans Peter Graf, David B. Roe, Speaker independent audio-visual database for bimodal ASR. Proc. AVSP'97. pp. 65- 68 ,(1997)

Quentin Summerfield, Some preliminaries to a comprehensive account of audio-visual speech perception. Lawrence Erlbaum Associates, Inc. ,(1987)

Peter L. Silsbee, Alan C. Bovik, Medium Vocabulary Audiovisual Speech Recognition Springer Berlin Heidelberg. pp. 120- 123 ,(1995) , 10.1007/978-3-642-57745-1_21

10.

Dominic W. Massaro, Bimodal Speech Perception: A Progress Report Springer Berlin Heidelberg. pp. 79- 101 ,(1996) , 10.1007/978-3-662-13015-5_6

Audio-visual integration in multimodal communication

来源期刊

我的账户

Audio-visual integration in multimodal communication

来源期刊

相似文章 10

我的账户