Bayesian bpproach based decision in speaker verification.

作者: Corinne Fredouille , Jean-François Bonastre , Téva Merlin

DOI:

关键词: Statistical hypothesis testingSpeaker verificationSpeaker recognitionBounded functionComputer scienceNormalization (statistics)Machine learningPattern recognitionSpeaker diarisationBayesian probabilityProbabilistic logicArtificial intelligence

摘要: Considering Bayesian decision framework applied in the context of speaker verification, this paper presents a new way handling troublesome anti-speaker model by proposing redefinition hypotheses involved classical statistical hypothesis test. This definition is then implemented through independent normalization technique, named MAP approach. Besides supporting these hypotheses, approach takes advantages projecting likelihood scores into probabilistic domain and therefore providing threshold with bounded meaningful values. In paper, different variants are presented which mainly aims at reducing variability, well-known verification to degrade system performance. firstly combined techniques (likelihood ratio (world model) and/or Hnorm technique). The second kind consists redesigning become dependent. Experiments conducted on subset Switchboard database involving have showed that able perform as well while yielding suitable for setting or fusion recognizer multi-recognizer architecture.

参考文章(5)
Corinne Fredouille, Jean-François Bonastre, Téva Merlin, Similarity normalization method based on world model and a posteriori probability for speaker verification. conference of the international speech communication association. ,(1999)
I. Magrin-Chagnolleau, Corinne Fredouille, Dominique Genoud, Guillaume Gravier, Frédéric Bimbot, G. Durou, Jamal Kharroubi, Jean Hennebert, Jean-François Bonastre, S. Pigeon, Patrick Verlinde, Gilles Caloz, Chafic Mokbel, Raphaël Blouet, T. Merlin, M. Seck, M. Zouhal, Gérard Chollet, J. Cernocky, B. Nedic, D. Petrovska-Delacretaz, The ELISA Systems for the NIST"99 Evaluation in Speaker Detection and Tracking DSP Journal (Special Issue on the Nist Speaker Recognition Workshop). ,(1999)
Mark Ordowski, Mark A. Przybocki, Alvin F. Martin, George R. Doddington, Terri Kamm, The DET Curve in Assessment of Detection Task Performance conference of the international speech communication association. ,(1997)
Roland Auckenthaler, Michael Carey, Harvey Lloyd-Thomas, Score Normalization for Text-Independent Speaker Verification Systems Digital Signal Processing. ,vol. 10, pp. 42- 54 ,(2000) , 10.1006/DSPR.1999.0360
D.A. Reynolds, The effects of handset variability on speaker recognition performance: experiments on the Switchboard corpus international conference on acoustics speech and signal processing. ,vol. 1, pp. 113- 116 ,(1996) , 10.1109/ICASSP.1996.540303