Integration of Complementary Acoustic Features for Speaker Recognition

作者: Nengheng Zheng , Tan Lee , P. C. Ching

DOI: 10.1109/LSP.2006.884031

关键词:

摘要: This letter describes a speaker verification system that uses complementary acoustic features derived from the vocal source excitation and tract system. A new feature set, named wavelet octave coefficients of residues (WOCOR), is proposed to capture spectro-temporal characteristics embedded in linear predictive residual signal. WOCOR used supplement conventional tract-related features, this case, Mel-frequency cepstral (MFCC), for verification. novel confidence measure-based score fusion technique applied integrate MFCC. Speaker experiments are carried out on NIST 2001 database. The equal error rate (EER) attained with method 7.67%, comparison 9.30% MFCC-based

参考文章(14)
Nengheng Zheng, P. C. Ching, Tan Lee, Time -frequency analysis of vocal source signal for speaker recognition. conference of the international speech communication association. ,(2004)
Mitchel Weintraub, Elizabeth Shriberg, Larry P. Heck, M. Kemal Sönmez, A lognormal tied mixture model of pitch for prosody based speaker recognition. conference of the international speech communication association. ,(1997)
Mark Ordowski, Mark A. Przybocki, Alvin F. Martin, George R. Doddington, Terri Kamm, The DET Curve in Assessment of Detection Task Performance conference of the international speech communication association. ,(1997)
Bojan Imperl, Zdravko Kačič, Bogomir Horvat, A study of harmonic features for the speaker recognition Speech Communication. ,vol. 22, pp. 385- 402 ,(1997) , 10.1016/S0167-6393(97)00053-8
Douglas A. Reynolds, Thomas F. Quatieri, Robert B. Dunn, Speaker Verification Using Adapted Gaussian Mixture Models Digital Signal Processing. ,vol. 10, pp. 19- 41 ,(2000) , 10.1006/DSPR.1999.0361
B. S. Atal, Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification The Journal of the Acoustical Society of America. ,vol. 55, pp. 1304- 1312 ,(1974) , 10.1121/1.1914702
Ingrid Daubechies, Ten Lectures on Wavelets ,(1992)
Lawrence R. Rabiner, Ronald W. Schafer, Digital Processing of Speech Signals ,(1978)
S. Furui, Cepstral analysis technique for automatic speaker verification IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 29, pp. 254- 272 ,(1981) , 10.1109/TASSP.1981.1163530