Properties of Auditory Model Representations.

作者: Fernando Perdigão , Luís Sá

DOI:

关键词: Speech recognitionNeurocomputational speech processingGaussianComputer scienceAcoustic modelRobustness (computer science)Speech processingHair cellLinear filter

摘要: We address the problem of robustness auditory models as front ends for speech recognition. Auditory have been referred superior when is corrupted by noise or linear filtering, but there not yet a deep understanding its functioning. analyze some commonly used and show that they present interesting properties which are useful robust In our view, short-time adaptation provided hair cell key factor this robustness. A disadvantage distributions obtained features well represented gaussian pdfs. discuss parameter transformation in order to use standard recognizer based on CDHMMs with pdfs digit recognition experiments.

参考文章(9)
Stephanie Seneff, A joint synchrony/mean-rate model of auditory speech processing Journal of Phonetics. ,vol. 16, pp. 101- 111 ,(1990) , 10.1016/S0095-4470(19)30466-8
Climent Nadeu, Mónica Gorricho, Javier Hernando, On the decorrelation of filter-bank energies in speech recognition conference of the international speech communication association. pp. 1381- 1384 ,(1995)
Stephen T. Neely, A model of cochlear mechanics with outer hair cell motility Journal of the Acoustical Society of America. ,vol. 94, pp. 137- 146 ,(1993) , 10.1121/1.407091
C.R. Jankowski, H.-D.H. Vo, R.P. Lippmann, A comparison of signal processing front ends for automatic word recognition IEEE Transactions on Speech and Audio Processing. ,vol. 3, pp. 286- 293 ,(1995) , 10.1109/89.397093
Fu-Hua Liu, Pedro J. Moreno, Richard M. Stern, Alejandro Acero, Signal processing for robust speech recognition Proceedings of the workshop on Human Language Technology - HLT '94. pp. 330- 335 ,(1994) , 10.3115/1075812.1075889
Ray Meddis, Simulation of mechanical to neural transduction in the auditory receptor. Journal of the Acoustical Society of America. ,vol. 79, pp. 702- 711 ,(1986) , 10.1121/1.393460
J.-P. Martens, L. Van Immerseel, An auditory model based on the analysis of envelope patterns International Conference on Acoustics, Speech, and Signal Processing. pp. 401- 404 ,(1990) , 10.1109/ICASSP.1990.115713
Kuansan Wang, S. Shamma, Self-normalization and noise-robustness in early auditory representations IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 421- 435 ,(1994) , 10.1109/89.294356
H. Hermansky, N. Morgan, RASTA processing of speech IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 578- 589 ,(1994) , 10.1109/89.326616