ROBUST RECOGNITION OF SMALL -VOCABULARY TELEPHONE - QUALITY SPEECH

作者: Victor Croitoru , Mihai Sima , Dragos Burileanu , Cristian Negrescu

DOI:

关键词:

摘要: Considerable progress has been made in the field of automatic speech recognition recent years, especially for high-quality (full bandwidth and noise-free) speech. However, good accuracy is difficult to achieve when incoming passed through a telephone channel. At same time, task over lines growing importance, as number applications spoken language processing involving increases every day. The paper presents our work on developing robust speaker-independent isolated-spoken word system based hybrid approach (classic – artificial neural network). A experiments are described compared order evaluate different analysis techniques that best suited telephone-speech task. In particular, we address use RASTA (i.e., filtering temporal trajectories parameters) increasing accuracy. Also, propose method adaptive filter theory producing simulated data starting from clean databases.

参考文章(17)
Daniele Falavigna, Roberto Gretter, Marco Orlandi, Tarcisio Coianiz, Use of simulated data for robust telephone speech recognition. conference of the international speech communication association. ,(1999)
Alex Acero, Xuedong Huang, Hsiao-Wuen Hon, Spoken Language Processing Prentice-Hall. pp. 1008- ,(2001)
Woei-Chyang Shieh, Sen-Chia Chang, The dependence of feature vectors under adverse noise. conference of the international speech communication association. ,(1999)
S Haykin, Adaptive Filter Theory ,(1986)
John L. Hennessy, David A. Patterson, Computer Architecture: A Quantitative Approach ,(1989)
Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of speech recognition ,(1993)
Alex Acero, Raj Reddy, Xuedong Huang, Hsiao-Wuen Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development ,(2001)
Hervé Bourlard, R. Boite, T. Dutoit, H. Leich, J. Hancq, Traitement de la Parole Presses Polytechniques Universitaires Romandes. ,(2000)
J. de Veth, L. Boves, Comparison of channel normalisation techniques for automatic speech recognition over the phone international conference on spoken language processing. ,vol. 4, pp. 2332- 2335 ,(1996) , 10.1109/ICSLP.1996.607275
Hervé Bourlard, Hynek Hermansky, Nelson Morgan, Towards increasing speech recognition error rates Speech Communication. ,vol. 18, pp. 205- 231 ,(1996) , 10.1016/0167-6393(96)00003-9