Within-session variability modelling for factor analysis speaker verification

作者: Robbie Vogt , Sridha Sridharan , Jason W. Pelecanos , Sachin S. Kajarekar , Nicolas Scheffer

DOI:

关键词:

摘要: This work presents an extended Joint Factor Analysis model including explicit modelling of unwanted within-session variability. The goals the proposed JFA are to improve verification performance with short utterances by compensating for effects limited or imbalanced phonetic coverage, and produce a flexible that is effective over wide range utterance lengths without adjusting parameters such as retraining session subspaces. Experimental results on 2006 NIST SRE corpus demonstrate flexibility providing competitive also yielding modest improvements in number conditions current state-of-the-art.

参考文章(8)
Christopher Lustri, Robert Vogt, Sridha Sridharan, Factor analysis modelling for speaker verification with short utterances Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop. pp. 19- ,(2008)
Petr Schwarz, Pavel Matějka, Jan Černocký, Towards Lower Error Rates in Phoneme Recognition text speech and dialogue. pp. 465- 472 ,(2004) , 10.1007/978-3-540-30120-2_59
Lukas Burget, Pavel Matejka, Petr Schwarz, Ondrej Glembek, Jan Honza Cernocky, Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 1979- 1986 ,(2007) , 10.1109/TASL.2007.902499
Nicolas Scheffer, Robbie Vogt, Sachin Kajarekar, Jason Pelecanos, Combination strategies for a factor analysis phone-conditioned speaker verification system international conference on acoustics, speech, and signal processing. pp. 4053- 4056 ,(2009) , 10.1109/ICASSP.2009.4960518
Najim Dehak, Pierre Dumouchel, Patrick Kenny, Modeling Prosodic Features With Joint Factor Analysis for Speaker Verification IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 2095- 2103 ,(2007) , 10.1109/TASL.2007.902758
Robbie Vogt, Sridha Sridharan, Explicit modelling of session variability for speaker verification Computer Speech & Language. ,vol. 22, pp. 17- 38 ,(2008) , 10.1016/J.CSL.2007.05.003
Pierre Dumouchel, Patrick Kenny, Experiments in speaker verification using factor analysis likelihood ratios Odyssey. pp. 219- 226 ,(2004)
Brendan Baker, Robbie Vogt, Sridha Sridharan, Factor analysis subspace estimation for speaker verification with short utterances conference of the international speech communication association. pp. 853- 856 ,(2008)