Video control of speech recognition

作者: Geoffrey W. Peters

DOI:

关键词: Video processingFilter (video)Speech recognitionVoice activity detectionPoint (typography)GestureComputer scienceVideo captureArtificial intelligenceVideo trackingMicrophone arrayComputer vision

摘要: Method and apparatus for using video input to control speech recognition systems is disclosed. In one embodiment, gestures of a user system are detected from input, used turn unit on off. another the position information supplied microphone array point source filter aid in selecting voice that moving about field camera supplying input.

参考文章(7)
o Pfu Limited Kiyono, Hiroyuki c, o Pfu Limited Itoh, Yasunari c, Conduct-along system ,(1997)
Kensuke Uehara, Method and apparatus for inputting a voice through a microphone Journal of the Acoustical Society of America. ,vol. 91, pp. 1196- 1196 ,(1989) , 10.1121/1.402590
H. Wang, P. Chu, Voice source localization for automatic camera pointing system in videoconferencing international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 187- 190 ,(1997) , 10.1109/ICASSP.1997.599595
Ce Wang, M.S. Brandstein, A hybrid real-time face tracking system international conference on acoustics speech and signal processing. ,vol. 6, pp. 3737- 3740 ,(1998) , 10.1109/ICASSP.1998.679696
D.E. Sturim, M.S. Brandstein, H.F. Silverman, Tracking multiple talkers using microphone-array measurements international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 371- 374 ,(1997) , 10.1109/ICASSP.1997.599650