Towards Robust Speech Analysis

作者: Jean-Claude Junqua , Jean-Paul Haton

DOI: 10.1007/978-1-4613-1297-0_7

关键词:

摘要: This chapter focuses on robust speech acquisition and analysis. It begins by reviewing methods that have been shown to provide important noise reduction improved quality at the first processing stage of a recognizer, namely level. Then, three auditory models, especially successful in conditions, are detailed. Together with their description, application ASR is also summarized. Finally, field spectral estimation ARMA modeling outlined.

参考文章(70)
Stephanie Seneff, A joint synchrony/mean-rate model of auditory speech processing Journal of Phonetics. ,vol. 16, pp. 101- 111 ,(1990) , 10.1016/S0095-4470(19)30466-8
Noboru Sugamura, Yoshio Nakadai, A speech recognition method for noise environments using dual inputs. conference of the international speech communication association. ,(1990)
Shaoyan Chen, Yuqing Gao, Jean Paul Haton, Taiyi Huang, Auditory model based speech processing. conference of the international speech communication association. ,(1992)
Quan fen Guan, Ren-Hua Wang, Hiroya Fujisaki, A method for robust GARMA analysis of speech. conference of the international speech communication association. ,(1990)
Yuqing Gao, Jean Paul Haton, Noise reduction and speech recognition in noise conditions tested on LPNN-based continuous speech recognition system. conference of the international speech communication association. ,(1993)
A. Basu, K.K. Paliwal, A comparative performance evaluation of adaptive ARMA spectral estimation methods for noisy speech international conference on acoustics speech and signal processing. pp. 691- 694 ,(1988) , 10.1109/ICASSP.1988.196680
G. Powell, P. Darlington, P. Wheeler, Practical adaptive noise reduction in the aircraft cockpit environment international conference on acoustics, speech, and signal processing. ,vol. 12, pp. 173- 176 ,(1987) , 10.1109/ICASSP.1987.1169639
M. Hunt, C. Lefebvre, Speech recognition using a cochlear model international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 1979- 1982 ,(1986) , 10.1109/ICASSP.1986.1168651
O. Ghitza, Speech analysis/Synthesis based on matching the synthesized and the original representations in the auditory nerve level international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 1995- 1998 ,(1986) , 10.1109/ICASSP.1986.1169191
Jae Lim, Estimation of LPC coefficients from speech waveforms degraded by additive random noise ICASSP '78. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 3, pp. 599- 601 ,(1978) , 10.1109/ICASSP.1978.1170570