Integration of acoustic information in a large vocabulary word recognizer

作者: V. Gupta , M. Lennig , P. Mermelstein

DOI: 10.1109/ICASSP.1987.1169578

关键词:

摘要: This paper proposes a new way of using vector quantization for improving recognition performance 60,000 word vocabulary speaker-trained isolated recognizer phonemic Markov model approach to speech recognition. We show that we can effectively increase the codebook size by dividing feature into two vectors lower dimensionality, and then quantizing training each separately. For small size, integration results parameter provides significant improvement in as compared entire set together. Even 64, obtained when procedure are quite close those Gaussian distribution vectors.

参考文章(8)
L. Bahl, R. Bakis, P. Cohen, A. Cole, F. Jelinek, B. Lewis, R. Mercer, Recognition results for several experimental acoustic processors ICASSP '79. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 4, pp. 249- 251 ,(1979) , 10.1109/ICASSP.1979.1170736
A. Poritz, A. Richter, On hidden Markov models in isolated word recognition international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 705- 708 ,(1986) , 10.1109/ICASSP.1986.1169200
Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159
L. R. Rabiner, B.-H. Juang, S. E. Levinson, M. M. Sondhi, Recognition of Isolated Digits Using Hidden Markov Models With Continuous Mixture Densities AT&T Technical Journal. ,vol. 64, pp. 1211- 1234 ,(1985) , 10.1002/J.1538-7305.1985.TB00272.X
L. R. Rabiner, S. E. Levinson, M. M. Sondhi, On the Application of Vector Quantization and Hidden Markov Models to Speaker-Independent, Isolated Word Recognition Bell System Technical Journal. ,vol. 62, pp. 1075- 1105 ,(1983) , 10.1002/J.1538-7305.1983.TB03115.X
S. Davis, P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 28, pp. 65- 74 ,(1980) , 10.1109/TASSP.1980.1163420
Stephen E Levinson, Lawrence R Rabiner, M Mohan Sondhi, None, An Introduction to the Application of the Theory of Probabilistic Functions of a Markov Process to Automatic Speech Recognition Bell System Technical Journal. ,vol. 62, pp. 1035- 1074 ,(1983) , 10.1002/J.1538-7305.1983.TB03114.X