Data-driven clustering for blind feature mapping in speaker verification.

作者: Michael W. Mason , Brendan J. Baker , Robert J. Vogt , Sridha Sridharan

DOI:

关键词: Speaker diarisationComputer sciencePattern recognitionNISTContext modelData-drivenChannel (digital image)Cluster analysisArtificial intelligenceSpeaker recognition

摘要: Handset and channel mismatch degrades the performance of automatic speaker recognition systems significantly. This paper enhances feature mapping technique by proposing an iterative clustering approach to context model generation which offers improvement in trained on labelled data potential train absence correctly background data. The clustered models is demonstrated expanded version NIST 2003 Extended Data Task (EDT) protocol.

参考文章(9)
Subramanian Sridharan, Robert Vogt, Jason Pelecanos, A Study on Standard and Iterative Map Adaptation for Speaker Recognition Proceedings of the 9th Australian International Conference on Speech Science and Technology. ,(2002)
Jan Kane, Gavin Kelly, Tony Mansfield, David Chandler, Biometric Product Testing Final Report ,(2001)
Ben Shahshahani, Remco Teunen, Larry P. Heck, A model-based transformational approach to robust speaker recognition. conference of the international speech communication association. pp. 495- 498 ,(2000)
Roland Auckenthaler, Michael Carey, Harvey Lloyd-Thomas, Score Normalization for Text-Independent Speaker Verification Systems Digital Signal Processing. ,vol. 10, pp. 42- 54 ,(2000) , 10.1006/DSPR.1999.0360
D.A. Reynolds, Channel robust speaker verification via feature mapping international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 53- 56 ,(2003) , 10.1109/ICASSP.2003.1202292
H. Hermansky, N. Morgan, RASTA processing of speech IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 578- 589 ,(1994) , 10.1109/89.326616
S. Davis, P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 28, pp. 65- 74 ,(1980) , 10.1109/TASSP.1980.1163420
Sridha Sridharan, Jason W. Pelecanos, Feature Warping for Robust Speaker Verification Proceedings of 2001 A Speaker Odyssey: The Speaker Recognition Workshop. pp. 213- 218 ,(2001)
A. K. Samingan, Bernie Mulgrew, L. Hanzo, S. Chen, IEEE International Conference on Acoustics Speech and Signal Processing ,(2001)