Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification

作者: Reda Dehak , Najim Dehak , Pierre Dumouchel , Pierre Ouellet , Niko Brummer

DOI:

关键词:

摘要: This paper presents a new speaker verification system architecture based on Joint Factor Analysis (JFA) as feature extractor. In this modeling, the JFA is used to define low-dimensional space named total variability factor space, instead of both channel and spaces for classical JFA. The main contribution in approach, use cosine kernel design two different systems: first Support Vector Machines based, second one uses directly decision score. last scoring method makes process faster less computation complex compared others methods. We tested several intersession compensation methods factors, we found that combination Linear Discriminate Within Class Covariance Normalization achieved best performance. remarkable results using fast only especially male trials, yield an EER 1.12% MinDCF 0.0094 English trials NIST 2008 SRE dataset. Index Terms: Total kernel, scoring, support vector machines.

参考文章(8)
Andreas Stolcke, Andrew O. Hatch, Sachin S. Kajarekar, Within-class covariance normalization for SVM-based speaker recognition. conference of the international speech communication association. ,(2006)
Douglas A. Reynolds, Thomas F. Quatieri, Robert B. Dunn, Speaker Verification Using Adapted Gaussian Mixture Models Digital Signal Processing. ,vol. 10, pp. 19- 41 ,(2000) , 10.1006/DSPR.1999.0361
Najim Dehak, Patrick Kenny, Reda Dehak, Ondrej Glembek, Pierre Dumouchel, Lukas Burget, Valiantsina Hubeika, Fabio Castaldo, Support vector machines and Joint Factor Analysis for speaker verification international conference on acoustics, speech, and signal processing. pp. 4237- 4240 ,(2009) , 10.1109/ICASSP.2009.4960564
P. Kenny, P. Ouellet, N. Dehak, V. Gupta, P. Dumouchel, A Study of Interspeaker Variability in Speaker Verification IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 980- 988 ,(2008) , 10.1109/TASL.2008.925147
W.M. Campbell, D.E. Sturim, D.A. Reynolds, A. Solomonoff, SVM Based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 97- 100 ,(2006) , 10.1109/ICASSP.2006.1659966
Pierre Ouellet, Najim Dehak, Patrick J Kenny, Réda Dehak, Pierre Dumouchel, Front-End Factor Analysis for Speaker Verification IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 19, pp. 788- 798 ,(2011) , 10.1109/TASL.2010.2064307
Ondrej Glembek, Lukas Burget, Najim Dehak, Niko Brummer, Patrick Kenny, Comparison of scoring methods used in speaker recognition with Joint Factor Analysis international conference on acoustics, speech, and signal processing. pp. 4057- 4060 ,(2009) , 10.1109/ICASSP.2009.4960519
Sridha Sridharan, Jason W. Pelecanos, Feature Warping for Robust Speaker Verification Proceedings of 2001 A Speaker Odyssey: The Speaker Recognition Workshop. pp. 213- 218 ,(2001)