Speech recognition apparatus and method

作者: Dukyung Jung

DOI:

关键词: Speech processingGestureSpeech analyticsAcoustic modelComputer scienceVoice activity detectionAudio miningMicrophoneSpeech codingSpeech recognition

摘要: The present specification relates to a speech recognition apparatus and method capable of accurately recognizing the user in an easy convenient manner without having operate start button or like. according embodiments comprises: camera for capturing image; microphone; control unit detecting preset gesture from image, and, if nonlexical word is detected signal which input through microphone point time at was detected, determining after as effective signal; signal.

参考文章(51)
Tetsunori Kobayashi, Masataka Goto, Koji Kitayama, Katunobu Itou, Speech starter: Noise-robust endpoint detection by using filled pauses conference of the international speech communication association. pp. 1237- 1240 ,(2003)
Jae Joon Han, Chang Kyu Choi, Byung In Yoo, Apparatus and method for controlling user interface using sound recognition ,(2012)
Patrick John Ehlen, Brant Jameson Vasilieff, Jay Henry Lieske, System and method for enhancing speech activity detection using facial feature detection ,(2011)
Pasquale DeMaio, Zhengyou Zhang, Clark Nicholson, Using detected visual cues to change computer system operating states ,(2005)
Anton Mikhailov, Steven Osman, Ruxin Chen, Gustavo A. Hernandez-Abrego, Interface with Gaze Detection and Voice Input ,(2011)
Royce A Levien, Richard T Lord, Robert W Lord, Mark A Malamud, John D Rinaldo Jr, Speech recognition adaptation systems based on adaptation data ,(2012)