A real-time trained system for robust speaker verification using relative space of anchor models

作者: Ali Sadeghi Naini , M. Mehdi Homayounpour , Abbas Samani

DOI: 10.1016/J.CSL.2009.07.002

关键词:

摘要: A real-time trained system for robust speaker verification is proposed. This was developed using a relative space of reference speakers, also referred to as anchor models. The training aspect the based on this space's intriguing features and properties. concept uses representation rather than an absolute representation, by comparing set well-trained speakers. advantage approach that instead estimating numerous parameters model speaker, only few number models are estimated. In order optimize performance proposed system, several techniques were assessed possible implementation in various blocks system. As result, best achieved where normalized vector's mutual angle with Minimum normalization method applied conjunction orthogonal virtual case, Equal Error Rate (EER) 0.12% 400 test samples 100 speakers obtained. addition assessment under normal conditions, evaluated abnormal conditions noisy or telephonic speech sequence contamination present. Experiments conducted case demonstrated that, most cases, outperforms systems even shortened sequences. Another major contribution research development more complex capable tackling effectively. other interesting employed. For purpose, novel enrichment construct tackle noise. results experiments part excellent ability conditions. Compared applying led lower error rates all cases low SNR values.

参考文章(25)
Benoit G. B. Fauve, Jean-François Bonastre, Driss Matrouf, Nicolas Scheffer, A Straightforward and Efficient Implementation of the Factor Analysis Model for Speaker Verification conference of the international speech communication association. pp. 1242- 1245 ,(2007)
Corinne Fredouille, Jean-François Bonastre, Teva Merlin, NON DIRECTLY ACOUSTIC PROCESS FOR COSTLESS SPEAKER RECOGNITION AND INDEXATION International Workshop on Intelligent Communication Technologies and Applications. ,(1999)
Yassine Mami, Delphine Charlet, Speaker identification by location in an optimal space of anchor models. conference of the international speech communication association. ,(2002)
Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of speech recognition ,(1993)
Saeedeh Momtazi, Hossein Sameti, Saman Vaisipour, Meysam Tefagh, Introducing a Framework to Create Telephony Speech Databases from Direct Ones international conference on systems signals and image processing. pp. 327- 330 ,(2007) , 10.1109/IWSSIP.2007.4381108
Olivier Thyes, Jean-Claude Junqua, Roland Kuhn, Patrick Nguyen, Speaker identification and verification using eigenvoices. conference of the international speech communication association. pp. 242- 245 ,(2000)
Yassine Mami, Delphine Charlet, Speaker recognition by location in the space of reference speakers Speech Communication. ,vol. 48, pp. 127- 141 ,(2006) , 10.1016/J.SPECOM.2005.06.014
Engin Avci, Zuhtu Hakan Akpolat, Speech recognition using a wavelet packet adaptive network based fuzzy inference system Expert Systems With Applications. ,vol. 31, pp. 495- 503 ,(2006) , 10.1016/J.ESWA.2005.09.058