Neural networks and radial basis functions in classifying static speech patterns

作者: Mahesan Niranjan , Frank Fallside

DOI: 10.1016/0885-2308(90)90009-U

关键词: Computer sciencePerceptronRadial basis function networkClass (biology)Radial basis functionPattern recognitionArtificial neural networkTimit databaseArtificial intelligenceSimple (abstract algebra)Speech patternsSpeech recognitionTheoretical computer scienceHuman-Computer InteractionSoftware

摘要: Abstract This paper compares the performances of three non-linear pattern classifiers in recognition static speech patterns. Two these are neural networks (Multi-layered perceptron and modified Kanerva model). The third is method Radial Basis Functions. A review several classification techniques similar to radial basis functions presented. class boundaries generated by different methods compared on simple two-dimensional examples. Experiments classifying eight vowels from a subset DARPA TIMIT database reported.

参考文章(10)
Frank Fallside, Mahesan Niranjan, On modelling the dynamics of speech patterns. ECST. pp. 1071- 1074 ,(1987)
M. J. D. Powell, Radial basis functions for multivariable interpolation: a review Algorithms for approximation. pp. 143- 167 ,(1987)
B. WIDROW, M. E. HOFF, Adaptive switching circuits Neurocomputing: foundations of research. pp. 123- 134 ,(1988) , 10.21236/AD0241531
Pentti Kanerva, Self-propagating search : a unified theory of memory Center for the Study of Language and Information. ,(1984)
Ian D. Longstaff, John F. Cross, A pattern recognition approach to understanding the multi-layer perceptron Pattern Recognition Letters. ,vol. 5, pp. 315- 319 ,(1987) , 10.1016/0167-8655(87)90072-9
Robinson, Niranjan, Fallside, Generalising the nodes of the error propagation network international joint conference on neural network. pp. 583- ,(1989) , 10.1109/IJCNN.1989.118343
Charles A. Micchelli, Interpolation of Scattered Data: Distance Matrices and Conditionally Positive Definite Functions Approximation Theory and Spline Functions. ,vol. 2, pp. 143- 145 ,(1984) , 10.1007/978-94-009-6466-2_7
R.W. Prager, F. Fallside, The modified Kanerva model for automatic speech recognition Computer Speech & Language. ,vol. 3, pp. 61- 81 ,(1989) , 10.1016/0885-2308(89)90015-6
Rolland L. Hardy, Multiquadric equations of topography and other irregular surfaces Journal of Geophysical Research. ,vol. 76, pp. 1905- 1915 ,(1971) , 10.1029/JB076I008P01905
B. Atal, Efficient coding of LPC parameters by temporal decomposition international conference on acoustics, speech, and signal processing. ,vol. 8, pp. 81- 84 ,(1983) , 10.1109/ICASSP.1983.1172248