Method of recognizing coherently spoken words

作者: Hermann Ney , Andreas Noll

DOI: 10.1121/1.405503

关键词:

摘要: During the recognition, speech values which are derived from sample of signals compared with reference values, words a given vocabulary each time being by sequence values. The then determined phonemes according to fixed pronouncing lexicon and for in learning phase, phoneme within word consisting number equal phase. In order approach transitions between phonemes, may also consist three sections constant By per phoneme, duration can be simulated more accurately. Different possibilities indicated determine distance value during recognition.

参考文章(13)
Masao Watari, Seibi Chiba, Pattern distance calculating equipment ,(1985)
Rick Parfitt, George M. White, Peter Deng, Ben Warren, Speech recognition system and method ,(1982)
Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370
John W. Klovstad, Speech recognition method having noise immunity Journal of the Acoustical Society of America. ,vol. 88, pp. 593- 593 ,(1984) , 10.1121/1.399877
George Vensko, Apparatus for automatic speech recognition Journal of the Acoustical Society of America. ,vol. 85, pp. 1812- 1812 ,(1983) , 10.1121/1.397922
Tsuneo Nitta, Speech recognition system Journal of the Acoustical Society of America. ,vol. 86, pp. 454- 454 ,(1983) , 10.1121/1.402877
Cory S. Myers, Continuous speech pattern recognizer Journal of the Acoustical Society of America. ,vol. 82, pp. 1863- 1863 ,(1981) , 10.1121/1.395726
H. Ney, The use of a one-stage dynamic programming algorithm for connected word recognition IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 32, pp. 188- 196 ,(1984) , 10.1109/TASSP.1984.1164320
H. Bourlard, Y. Kamp, C. Wellekens, Speaker dependent connected speech recognition via phonetic Markov models international conference on acoustics, speech, and signal processing. ,vol. 10, pp. 1213- 1216 ,(1985) , 10.1109/ICASSP.1985.1168285