Inter-speaker variability in forensic voice comparison: A preliminary evaluation

作者: Moez Ajili , Jean-francois Bonastre , Solange Rossetto , Juliette Kahn

DOI: 10.1109/ICASSP.2016.7472050

关键词: Normalization (statistics)Reliability (statistics)Speaker diarisationProcess (engineering)Bayesian paradigmSpeaker recognitionComputer scienceSpeech recognition

摘要: In forensic voice comparison, it is strongly recommended to follow Bayesian paradigm. this paradigm, the strength of evidence summarized by a likelihood ratio (LR). The LR magnitude quantifies evidence: far from unity for meaningful (a which supports one hypothesis); close when next useless. Despite nice theoretical aspect, does not embed reliability its estimation process itself. And, in various cases, lack inside able destroy resulting LR. It particularly true comparison considered, as Speaker Recognition (SR) systems are outputting score all situations regardless case specific conditions. Furthermore, SR use different normalization steps see their scores and these clearly potential source bias. Consequently, complete view should be taken into account comparison. This article focuses on part question, "speaker factor", characteristics behaviors two speakers involved trial.

参考文章(28)
Benoit G. B. Fauve, Jean-François Bonastre, Driss Matrouf, Nicolas Scheffer, A Straightforward and Efficient Implementation of the Factor Analysis Model for Speaker Verification conference of the international speech communication association. pp. 1242- 1245 ,(2007)
Olivier Galibert, Guillaume Gravier, Gilles Adda, Aude Giraudel, Niklas Paulsson, Matthieu Carr'e, The ETAPE corpus for the evaluation of speech-based TV content processing in the French language language resources and evaluation. pp. 114- 118 ,(2012)
Yuri Matveev, The Problem of Voice Template Aging in Speaker Recognition Systems international conference on speech and computer. pp. 345- 353 ,(2013) , 10.1007/978-3-319-01931-4_46
Mark A. Przybocki, Douglas A. Reynolds, Alvin F. Martin, George R. Doddington, Walter Liggett, Sheep, Goats, Lambs and Wolves: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation conference of the international speech communication association. ,(1998)
Joaquin Gonzalez-Rodriguez, Daniel Ramos, Forensic Automatic Speaker Classification in the Coming Paradigm Shift Speaker Classification I. pp. 205- 217 ,(2007) , 10.1007/978-3-540-74200-5_11
Guillaume Gravier, Jean-François Bonastre, Khalid Choukri, Djamel Mostefa, Sylvain Galliano, Edouard Geoffrois, The ESTER Phase II Evaluation Campaign for the Rich Transcription of French Broadcast News conference of the international speech communication association. pp. 1149- 1152 ,(2005)
Geoffrey Stewart Morrison, Forensic voice comparison and the paradigm shift Science & Justice. ,vol. 49, pp. 298- 308 ,(2009) , 10.1016/J.SCIJUS.2009.09.002
Niko Brümmer, Johan du Preez, Application-independent evaluation of speaker detection Computer Speech & Language. ,vol. 20, pp. 230- 275 ,(2006) , 10.1016/J.CSL.2005.08.001
Phil Rose, Technical forensic speaker recognition: Evaluation, types and testing of evidence Computer Speech & Language. ,vol. 20, pp. 159- 191 ,(2006) , 10.1016/J.CSL.2005.07.003