Pitch and spectral estimation of speech based on auditory synchrony model

作者: S. Seneff

DOI: 10.1109/ICASSP.1984.1172757

关键词:

摘要: This paper describes a system for processing sonorant regions of speech, motivated by knowledge the human auditory system. The spectral representation is intended to reflect proposed model which takes advantage synchrony in nerve firing patterns enhance formant peaks. also applied pitch extraction, and thus temporal processor envisioned. spectrum derived from outputs set linear fillers with critical bandwidths. Saturation adaptation are incorporated each filter independently. Each "spectral" coefficient determined weighting amplitude response at that frequency measure center filter. Pitch front waveform generated adding rectified across dimension. estimator illustrated pure tones natural speech.

参考文章(5)
H. Steven Colburn, Theory of binaural interaction based on auditory‐nerve data. I. General strategy and preliminary results on interaural discrimination The Journal of the Acoustical Society of America. ,vol. 54, pp. 1458- 1470 ,(1973) , 10.1121/1.1914445
Murray B. Sachs, Eric D. Young, Effects of nonlinearities on speech encoding in the auditory nerve Journal of the Acoustical Society of America. ,vol. 68, pp. 858- 875 ,(1979) , 10.1121/1.384825
P. Srulovicz, J. L. Goldstein, A central spectrum model: A synthesis of auditory‐nerve timing and place cues in monaural communication of frequency spectrum Journal of the Acoustical Society of America. ,vol. 73, pp. 1266- 1276 ,(1983) , 10.1121/1.389275
R. L. Smith, J. J. Zwislocki, Short-term adaptation and incremental responses of single auditory-nerve fibers Biological Cybernetics. ,vol. 17, pp. 169- 182 ,(1975) , 10.1007/BF00364166
E. Zwicker, Subdivision of the Audible Frequency Range into Critical Bands (Frequenzgruppen) The Journal of the Acoustical Society of America. ,vol. 33, pp. 248- 248 ,(1961) , 10.1121/1.1908630