Support vector machines and Joint Factor Analysis for speaker verification

作者: Najim Dehak , Patrick Kenny , Reda Dehak , Ondrej Glembek , Pierre Dumouchel

DOI: 10.1109/ICASSP.2009.4960564

关键词:

摘要: This article presents several techniques to combine between Support vector machines (SVM) and Joint Factor Analysis (JFA) model for speaker verification. In this combination, the SVMs are applied different sources of information produced by JFA. These informations Gaussian Mixture Model supervectors speakers Common factors. We found that using SVM in JFA factors gave best results especially when within class covariance normalization method is order compensate channel effect. The new combination comparable other classical scoring techniques.

参考文章(11)
Daniele Colibro, Claudio Vair, Emanuele Dalmasso, Kevin Farrell, Gennady Karvitsky, Sandro Cumani, Pietro Laface, Loquendo - Politecnico di Torino's 2006 NIST Speaker Recognition Evaluation System conference of the international speech communication association. ,vol. 1, pp. 1338- 1342 ,(2017) , 10.21437/INTERSPEECH.2017-797
Andreas Stolcke, Andrew O. Hatch, Sachin S. Kajarekar, Within-class covariance normalization for SVM-based speaker recognition. conference of the international speech communication association. ,(2006)
Douglas A. Reynolds, Thomas F. Quatieri, Robert B. Dunn, Speaker Verification Using Adapted Gaussian Mixture Models Digital Signal Processing. ,vol. 10, pp. 19- 41 ,(2000) , 10.1006/DSPR.1999.0361
Patrick Kenny, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel, Joint Factor Analysis Versus Eigenchannels in Speaker Recognition IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 1435- 1447 ,(2007) , 10.1109/TASL.2006.881693
Patrick Kenny, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel, Speaker and Session Variability in GMM-Based Speaker Verification IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 1448- 1460 ,(2007) , 10.1109/TASL.2007.894527
P. Kenny, P. Ouellet, N. Dehak, V. Gupta, P. Dumouchel, A Study of Interspeaker Variability in Speaker Verification IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 980- 988 ,(2008) , 10.1109/TASL.2008.925147
W.M. Campbell, D.E. Sturim, D.A. Reynolds, A. Solomonoff, SVM Based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 97- 100 ,(2006) , 10.1109/ICASSP.2006.1659966
W.M. Campbell, D.E Sturim, D.A. Reynolds, Support vector machines using GMM supervectors for speaker verification IEEE Signal Processing Letters. ,vol. 13, pp. 308- 311 ,(2006) , 10.1109/LSP.2006.870086
Niko Brummer, Lukas Burget, Jan Cernocky, Ondrej Glembek, Frantisek Grezl, Martin Karafiat, David A. van Leeuwen, Pavel Matejka, Petr Schwarz, Albert Strasheim, Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006 IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 2072- 2084 ,(2007) , 10.1109/TASL.2007.902870
Sridha Sridharan, Jason W. Pelecanos, Feature Warping for Robust Speaker Verification Proceedings of 2001 A Speaker Odyssey: The Speaker Recognition Workshop. pp. 213- 218 ,(2001)