A joint synchrony/mean-rate model of auditory speech processing

作者: Stephanie Seneff

DOI: 10.1016/S0095-4470(19)30466-8

关键词:

摘要: This paper describes a speech processing system that is based on properties of the human auditory system. A bank critical-band filters defines initial spectral analysis. Filter outputs are processed by model nonlinear transduction stage in cochlea, which accounts for such features as saturation, adaptation and forward masking. The parameters were adjusted to match existing experimental results physiology periphery. output this delivered two parallel channels, each produces representations appropriate distinct subtasks recognition One path yields an overall energy measure channel can be identified with average rate neural discharge. appear useful locating acoustic events assigning segments broad phonetic categories. In other path, extent dominance periodicities at channel’s center frequency captured synchrony measure, representation enhanced contrast, relative mean-rate spectrogram. show formant peaks during sonorant regions, smooth transitions over time, well preserving prominences high-frequency region fricatives stops.

参考文章(27)
Stephanie Seneff, Pitch and spectral analysis of speech based on an auditory synchrony model Massachusetts Institute of Technology, Research Laboratory of Electronics. ,(1985)
S. Seneff, Pitch and spectral estimation of speech based on auditory synchrony model international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 45- 48 ,(1984) , 10.1109/ICASSP.1984.1172757
Eric Javel, C. Daniel Geisler, A. Ravindran, Two‐tone suppression in auditory nerve of the cat: Rate‐intensity and temporal analyses The Journal of the Acoustical Society of America. ,vol. 63, pp. 1093- 1104 ,(1978) , 10.1121/1.381817
M. I. Miller, M. B. Sachs, Representation of stop consonants in the discharge patterns of auditory-nerve fibers. Journal of the Acoustical Society of America. ,vol. 74, pp. 502- 517 ,(1983) , 10.1121/1.389816
Murray B. Sachs, Eric D. Young, Effects of nonlinearities on speech encoding in the auditory nerve Journal of the Acoustical Society of America. ,vol. 68, pp. 858- 875 ,(1979) , 10.1121/1.384825
Eric D. Young, Murray B. Sachs, Representation of steady‐state vowels in the temporal aspects of the discharge patterns of populations of auditory‐nerve fibers Journal of the Acoustical Society of America. ,vol. 66, pp. 1381- 1403 ,(1979) , 10.1121/1.383532
P. Srulovicz, J. L. Goldstein, A central spectrum model: A synthesis of auditory‐nerve timing and place cues in monaural communication of frequency spectrum Journal of the Acoustical Society of America. ,vol. 73, pp. 1266- 1276 ,(1983) , 10.1121/1.389275
R. L. Smith, J. J. Zwislocki, Short-term adaptation and incremental responses of single auditory-nerve fibers Biological Cybernetics. ,vol. 17, pp. 169- 182 ,(1975) , 10.1007/BF00364166