A Novel Speaker Binary Key Derived from Anchor Models

作者: Xavier Anguera , Jean-François Bonastre

DOI:

关键词:

摘要: The approach presented in this paper represents voice recordings by a novel acoustic key composed only of binary values. Except for the process being used to extract such keys, there is no need modeling and processing proposed, as all other elements system are based on vectors. We show that able effectively model speaker’s distinguish it from speakers. Its main properties its small size compared current speaker techniques low computational cost when comparing different speakers limited obtaining similarity metric between two Furthermore, vector extraction does not any threshold offers opportunity set decision steps well defined domain where scores decisions easy interpret implement. Index Terms: key, modeling, biometrics

参考文章(10)
Corinne Fredouille, Jean-François Bonastre, Teva Merlin, NON DIRECTLY ACOUSTIC PROCESS FOR COSTLESS SPEAKER RECOGNITION AND INDEXATION International Workshop on Intelligent Communication Technologies and Applications. ,(1999)
Yassine Mami, Delphine Charlet, Speaker identification by location in an optimal space of anchor models. conference of the international speech communication association. ,(2002)
Matthew A Siegler, Uday Jain, Bhiksha Raj, Richard M Stern, Automatic Segmentation, Classification and Clustering of Broadcast News Audio DARPA Speech Recognition Workshop, 1997. pp. 97- 99 ,(1997)
Leandro Rodrı́guez-Liñares, Carmen Garcı́a-Mateo, José Luis Alba-Castro, On combining classifiers for speaker authentication Pattern Recognition. ,vol. 36, pp. 347- 359 ,(2003) , 10.1016/S0031-3203(02)00035-3
Niko Brümmer, Johan du Preez, Application-independent evaluation of speaker detection Computer Speech & Language. ,vol. 20, pp. 230- 275 ,(2006) , 10.1016/J.CSL.2005.08.001
Douglas A. Reynolds, Thomas F. Quatieri, Robert B. Dunn, Speaker Verification Using Adapted Gaussian Mixture Models Digital Signal Processing. ,vol. 10, pp. 19- 41 ,(2000) , 10.1006/DSPR.1999.0361
P. Kenny, G. Boulianne, P. Dumouchel, Eigenvoice modeling with sparse training data IEEE Transactions on Speech and Audio Processing. ,vol. 13, pp. 345- 354 ,(2005) , 10.1109/TSA.2004.840940
Frédéric Bimbot, Jean-François Bonastre, Corinne Fredouille, Guillaume Gravier, Ivan Magrin-Chagnolleau, Sylvain Meignier, Teva Merlin, Javier Ortega-García, Dijana Petrovska-Delacrétaz, Douglas A Reynolds, A Tutorial on Text-Independent Speaker Verification EURASIP Journal on Advances in Signal Processing. ,vol. 2004, pp. 430- 451 ,(2004) , 10.1155/S1110865704310024
W.M. Campbell, D.E. Sturim, D.A. Reynolds, A. Solomonoff, SVM Based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 97- 100 ,(2006) , 10.1109/ICASSP.2006.1659966
Niko Brummer, Lukas Burget, Jan Cernocky, Ondrej Glembek, Frantisek Grezl, Martin Karafiat, David A. van Leeuwen, Pavel Matejka, Petr Schwarz, Albert Strasheim, Fusion of Heterogeneous Speaker Recognition Systems in the STBU Submission for the NIST Speaker Recognition Evaluation 2006 IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 15, pp. 2072- 2084 ,(2007) , 10.1109/TASL.2007.902870