作者: L. Besacier , J.F. Bonastre
DOI: 10.1109/ICASSP.1998.675377
关键词:
摘要: In this paper, we propose a frame selection procedure for text-independent speaker identification. Instead of averaging the likelihoods along whole test utterance, some these are rejected (pruning) and final score is computed with limited number frames. This pruning stage requires prior level likelihood normalization in order to make comparison between frames meaningful. alone leads significant performance enhancement. As far as concerned, optimal pruned learned on tuning data set normal telephone speech. Validation 567 speakers 27% identification rate improvement TIMIT, 17% NTIMIT.