Flexible vowel recognition by the generation of dynamic coherence in oscillator neural networks: speaker-independent vowel recognition

作者: Fang Liu , Yoko Yamaguchi , Hiroshi Shimizu

DOI: 10.1007/BF00197313

关键词: Background noisePsychologyArtificial neural networkSynchronizationCoherence (signal processing)VowelRobustness (computer science)Speech recognitionFormantLinear prediction

摘要: We propose a new model for speaker-independent vowel recognition which uses the flexibility of dynamic linking that results from synchronization oscillating neural units. The system consists an input layer and three layers, are referred to as A-, B- C-centers. signals time series linear prediction (LPC) spectrum envelopes auditory signals. At each time-window within series, A-center receives extracts local peaks envelope, i.e., formants, encodes them into groups independent oscillations. Speaker-independent characteristics embedded connection matrix in B-center according statistical data Japanese vowels. associative interaction reciprocal between A- B-centers selectively activate global synchronized pattern over two centers. C-center evaluates activities among formant regions give selective output category five Thus, flexible ability dynamical features is achieved capability present was investigated demonstrated remarkable vowels very similar human listeners, including misleading In addition, it showed stable unsteady robustness against background noise. optimum condition frequency oscillation discussed comparison with stimulus-dependent synchronizations observed neurophysiological experiments cortex.

参考文章(23)
K. Fukunishi, N. Murai, H. Uno, T. Miyashita, Cortical neural networks revealed by spatio-temporal neural observation and analysis on guinea pig auditory cortex international joint conference on neural network. ,vol. 1, pp. 73- 76 ,(1993) , 10.1109/IJCNN.1993.713862
H Shimizu, Y Yamaguchi, Synergetic computer and holonics - information dynamics of a semantic computer Physica Scripta. ,vol. 36, pp. 970- 985 ,(1987) , 10.1088/0031-8949/36/6/016
Yoko Yamaguchi, Hiroshi Shimizu, Pattern recognition with figure-ground separation by generation of coherent oscillations Neural Networks. ,vol. 7, pp. 49- 63 ,(1994) , 10.1016/0893-6080(94)90055-8
Ikuo Taniguchi, Junsei Horikawa, Toshio Moriyama, Masahiro Nasu, Spatio-temporal pattern of frequency representation in the auditory cortex of guinea pigs Neuroscience Letters. ,vol. 146, pp. 37- 40 ,(1992) , 10.1016/0304-3940(92)90166-5
E. Colin Cherry, Some Experiments on the Recognition of Speech, with One and with Two Ears The Journal of the Acoustical Society of America. ,vol. 25, pp. 975- 979 ,(1953) , 10.1121/1.1907229
Christoph von der Malsburg, Joachim Buhmann, Sensory segmentation with coupled neural oscillators Biological Cybernetics. ,vol. 67, pp. 233- 242 ,(1992) , 10.1007/BF00204396
Rube Chernikoff, W. J. Brogden, The effect of response termination of the stimulus upon reaction time. Journal of Comparative and Physiological Psychology. ,vol. 42, pp. 357- 364 ,(1949) , 10.1037/H0062553
Ch. von der Malsburg, W. Schneider, A neural cocktail-party processor Biological Cybernetics. ,vol. 54, pp. 29- 40 ,(1986) , 10.1007/BF00337113
H. T. Tiitinen, J. Sinkkonen, K. Reinikainen, K. Alho, J. Lavikainen, R. Näätänen, Selective attention enhances the auditory 40-Hz transient response in humans Nature. ,vol. 364, pp. 59- 60 ,(1993) , 10.1038/364059A0
O. Sporns, J. A. Gally, G. N. Reeke, G. M. Edelman, Reentrant signaling among simulated neuronal groups leads to coherency in their oscillatory activity Proceedings of the National Academy of Sciences of the United States of America. ,vol. 86, pp. 7265- 7269 ,(1989) , 10.1073/PNAS.86.18.7265