Best Wavelet Filter for a Wavelet Neural Fricatives Recognition System

作者: Dr. Ahmed Maamoon Alkababji

DOI: 10.33899/RENGJ.2011.26617

关键词:

摘要: Direct recognition of phonemes in speaker independent speech systems still cannot guarantee good enough results. But grouping at first then trying to recognize the phoneme itself is a promising field. On other hand wavelets are widely used and systems, this motivated by ability wavelet coefficients capture important time frequency features. In work effect filter type on efficiency system investigated (specifically fricatives). The Probabilistic neural network was as pattern matching stage for its well known power full solving classification problems. It found that Daubechies family (generally from db15 db23) candidate fricatives based feature extraction stage.

参考文章(20)
A.M.A. Ali, J. Van der Spiegel, P. Mueller, Acoustic-phonetic features for the automatic classification of stop consonants IEEE Transactions on Speech and Audio Processing. ,vol. 9, pp. 833- 841 ,(2001) , 10.1109/89.966086
Iosif Mporas, Todor Ganchev, Mihalis Siafarikas, Nikos Fakotakis, Comparison of Speech Features on the Speech Recognition Task Journal of Computer Science. ,vol. 3, pp. 608- 616 ,(2007) , 10.3844/JCSSP.2007.608.616
Mohamed El-Wakdy, Ehab El-Sehely, Mostafa El-Tokhy, Adel El-Hennawy, N Mastorakis, V Mladenov, Z Bojkovic, D Simian, S Kartalopoulos, A Varonides, Speech recognition using a wavelet transform to establish fuzzy inference system through subtractive clustering and neural network (ANFIS) international conference on systems. pp. 381- 386 ,(2008)
Børge Lindberg, Robert Modic, Bojan Petek, Comparative wavelet and MFCC speech recognition experiments on the Slovenian and English speechdat2. non-linear speech processing. pp. 16- ,(2003)
Mihalis Siafarikas, Nikos Fakotakis, Todor Ganchev, Wavelet packet based speaker verification. Odyssey. pp. 257- 264 ,(2004)
Nancy L. Dahlgren, Jonathan G. Fiscus, L F. Lamel, D S. Pallett, John S. Garofolo, W M. Fisher, Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST NIST Interagency/Internal Report (NISTIR) - 4930. ,(1993)
Kjell Elenius, Hans G. C. Tråvén, Multi-layer perceptrons and probabilistic neural networks for phoneme recognition. conference of the international speech communication association. ,(1993)
Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of speech recognition ,(1993)