作者: Hamed Talea , Khashayar Yaghmaie
DOI: 10.1109/ICCSN.2011.6014877
关键词: Feature extraction 、 Acoustic model 、 Artificial intelligence 、 Audio mining 、 Speech segmentation 、 Syllable 、 Speech processing 、 Image segmentation 、 Computer science 、 Voice activity detection 、 Pattern recognition 、 Speech recognition
摘要: Speech recognition techniques which rely on audio features of speech degrade in performance noisy environments. Visual Recognition helps this by incorporating a visual signal into the process. The automatic (ASR) system can be significantly enhanced with additional information from elements such as movement lips, tongue, and teeth. This paper introduces combined method for lip region extraction mouth area estimation, is then used to develop technique segmentation. accuracy verified applying it syllable boundary separation following vowel segmentation multi words phrases.