The modulation spectrum in the automatic recognition of speech

作者: H. Hermansky

DOI: 10.1109/ASRU.1997.658998

关键词:

摘要: … Abstract - This work questions the reliability of the short-term spectral envelope as the dominant carrier of the phonetic identity of a given speech instant and suggests the temporal …

参考文章(14)
Hans-Wilhelm Rühl, Hans-Günter Hirsch, Peter Meyer, Improved speech recognition using high-pass filtering of subband envelopes. conference of the international speech communication association. ,(1991)
Hynek Hermansky, Misha Pavel, Takayuki Arai, Noboru Kanedera, On the importance of various modulation frequencies for speech recognition. conference of the international speech communication association. ,(1997)
Hynek Hermansky, Sarel van Vuuren, Data-driven design of RASTA-like filters. conference of the international speech communication association. ,(1997)
T. Arai, M. Pavel, H. Hermansky, C. Avendano, Intelligibility of speech with filtered time trajectories of spectral envelopes international conference on spoken language processing. ,vol. 4, pp. 2490- 2493 ,(1996) , 10.1109/ICSLP.1996.607318
On the properties of temporal processing for speech in adverse environments workshop on applications of signal processing to audio and acoustics. pp. 4- ,(1997) , 10.1109/ASPAA.1997.625589
Rob Drullman, Joost M. Festen, Reinier Plomp, Effect of temporal envelope smearing on speech reception The Journal of the Acoustical Society of America. ,vol. 95, pp. 1053- 1064 ,(1994) , 10.1121/1.408467
Melvyn J. Hunt, A statistical approach to metrics for word and syllable recognition Journal of the Acoustical Society of America. ,vol. 66, ,(1979) , 10.1121/1.2017735
Hynek Hermansky, Should recognizers have ears Speech Communication. ,vol. 25, pp. 3- 27 ,(1998) , 10.1016/S0167-6393(98)00027-2
T. Houtgast, H. J. M. Steeneken, A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria Journal of the Acoustical Society of America. ,vol. 77, pp. 1069- 1077 ,(1985) , 10.1121/1.392224
S. Furui, Cepstral analysis technique for automatic speaker verification IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 29, pp. 254- 272 ,(1981) , 10.1109/TASSP.1981.1163530