STBU System for the NIST 2006 Speaker Recognition Evaluation

作者: Pavel Matejka , Lukás Burget , Petr Schwarz , Ondrej Glembek , Martin Karafiat

DOI: 10.1109/ICASSP.2007.367203

关键词:

摘要: This paper describes STBU 2006 speaker recognition system, which performed well in the NIST evaluation. is consortium of 4 partners: Spescom DataVoice (South Africa), TNO (Netherlands), BUT (Czech Republic) and University Stellenbosch Africa). The primary system a combination three main kinds systems: (1) GMM, with short-time MFCC or PLP features, (2) GMM-SVM, using GMM mean supervectors as input (3) MLLR-SVM, MLLR adaptation coefficients derived from English LVCSR system. In this paper, we describe these sub-systems present results for each alone on Speaker Recognition Evaluation (SRE) development evaluation data sets.

参考文章(12)
Douglas A. Reynolds, A Gaussian mixture modeling approach to text-independent speaker identification Georgia Institute of Technology. ,(1992)
Andreas Stolcke, Elizabeth Shriberg, Luciana Ferrer, Anand Venkataraman, Sachin S. Kajarekar, MLLR transforms as features in speaker recognition. conference of the international speech communication association. pp. 2425- 2428 ,(2005)
Niko Brümmer, Johan du Preez, Application-independent evaluation of speaker detection Computer Speech & Language. ,vol. 20, pp. 230- 275 ,(2006) , 10.1016/J.CSL.2005.08.001
Lukas Burget, Pavel Matejka, Petr Schwarz, Ondrej Glembek, Jan Honza Cernocky, Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 1979- 1986 ,(2007) , 10.1109/TASL.2007.902499
Michael W. Mason, Brendan J. Baker, Robert J. Vogt, Sridha Sridharan, Data-driven clustering for blind feature mapping in speaker verification. conference of the international speech communication association. pp. 3109- 3112 ,(2005)
D.A. Reynolds, Channel robust speaker verification via feature mapping international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 53- 56 ,(2003) , 10.1109/ICASSP.2003.1202292
P. Schwarz, P. Matejka, J. Cernocky, Hierarchical Structures of Neural Networks for Phoneme Recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 325- 328 ,(2006) , 10.1109/ICASSP.2006.1660023
W.M. Campbell, D.E Sturim, D.A. Reynolds, Support vector machines using GMM supervectors for speaker verification IEEE Signal Processing Letters. ,vol. 13, pp. 308- 311 ,(2006) , 10.1109/LSP.2006.870086
A. Solomonoff, W.M. Campbell, I. Boardman, Advances in channel compensation for SVM speaker recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 629- 632 ,(2005) , 10.1109/ICASSP.2005.1415192
Chih-Chung Chang, Chih-Jen Lin, LIBSVM ACM Transactions on Intelligent Systems and Technology. ,vol. 2, pp. 1- 27 ,(2011) , 10.1145/1961189.1961199