Neural networks for automatic speech recognition: a review

作者: Jean-Paul Haton

DOI: 10.1007/978-1-4471-0845-0_14

关键词:

摘要: Most present automatic speech recognition systems are based on stochastic models, especially hidden Markov models (HMMs). However, during the past ten years, several projects have been directed toward use of a new class models: connectionist artificial neural networks (ANNs).

参考文章(101)
G.J. Gibson, S. Siu, C.F.N. Cowen, Multilayer perceptron structures applied to adaptive equalisers for data communications international conference on acoustics, speech, and signal processing. pp. 1183- 1186 ,(1989) , 10.1109/ICASSP.1989.266645
S. Tamura, A. Waibel, Noise reduction using connectionist models international conference on acoustics speech and signal processing. pp. 553- 556 ,(1988) , 10.1109/ICASSP.1988.196643
E. Singer, R.P. Lippman, A speech recognizer using radial basis function neural networks in an HMM framework international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 629- 632 ,(1992) , 10.1109/ICASSP.1992.225830
E. Tsiang, A neural architecture for computing acoustic-phonetic invariants international conference on acoustics speech and signal processing. ,vol. 2, pp. 1109- 1112 ,(1998) , 10.1109/ICASSP.1998.675463
Yann LeCun, John Denker, Sara Solla, None, Optimal Brain Damage neural information processing systems. ,vol. 2, pp. 598- 605 ,(1989)
A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, K.J. Lang, Phoneme recognition using time-delay neural networks IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 37, pp. 393- 404 ,(1989) , 10.1109/29.21701
Jianxiong Wu, Chorkin Chan, Isolated word recognition by neural network models with cross-correlation coefficients for speech dynamics IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 15, pp. 1174- 1185 ,(1993) , 10.1109/34.244678
A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, K. Lang, Phoneme recognition: neural networks vs. hidden Markov models vs. hidden Markov models international conference on acoustics speech and signal processing. pp. 107- 110 ,(1988) , 10.1109/ICASSP.1988.196523
B. Mak, Combining ANNs to improve phone recognition international conference on acoustics, speech, and signal processing. ,vol. 4, pp. 3253- 3256 ,(1997) , 10.1109/ICASSP.1997.595487
L. Holmstrom, A. Hamalainen, The self-organizing reduced kernel density estimator IEEE International Conference on Neural Networks. pp. 417- 421 ,(1993) , 10.1109/ICNN.1993.298593