System for using silence in speech recognition

作者: Li Jiang

DOI: 10.1121/1.1536510

关键词:

摘要: A system (60) for recognizing speech based on an input data stream indicative of the provides possible words represented by as a prefix tree (88) including plurality phoneme branches connected at nodes. The is bracketed least one silence branch (92) corresponding to phone side and output (94, 96, 98) (60). traversed obtain word that likely stream. phones provided in can vary context.

参考文章(19)
A. Averbuch, L. Bahl, R. Bakis, P. Brown, A. Cole, G. Daggett, S. Das, K. Davies, S. DeGennaro, P. de Souza, E. Epstein, D. Fraleigh, F. Jelinek, S. Katz, B. Lewis, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman, P. Spinelli, An IBM PC based large-vocabulary isolated-utterance speech recognizer international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 53- 56 ,(1986) , 10.1109/ICASSP.1986.1169169
Fileno A Alleva, Mei-Yuh Hwang, Xuedong D Huang, Li Jiang, Speech recognition system for recognizing continuous and isolated speech ASAJ. ,vol. 109, pp. 456- ,(1998)
Robert Kenneth MacFarlane, Fire detection system ,(1996)
John Edward Talintyre, Kevin Joseph Power, Stephen Howard Johnson, Simon Patrick Ringland, Francis James Scahill, Speech recognition with sequence parsing, rejection and pause detection options ,(1994)
Finelo A. Alleva, Senone tree representation and evaluation Journal of the Acoustical Society of America. ,vol. 105, pp. 1450- ,(1997) , 10.1121/1.426676
Lalit R. Bahl, Speech recognition employing a set of Markov models that includes Markov models representing transitions to and from silence Journal of the Acoustical Society of America. ,vol. 93, pp. 3019- 3019 ,(1988) , 10.1121/1.405760
H. Ney, R. Haeb-Umbach, B.-H. Tran, M. Oerder, Improvements in beam search for 10000-word continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 9- 12 ,(1992) , 10.1109/ICASSP.1992.225985