作者: Hagai Aronowitz
DOI:
关键词:
摘要: A method and system for speaker diarization are provided. Pre-trained acoustic models of individual and/or groups speakers obtained. Speech data with multiple is received divided into frames. For a frame, an feature vector determined extended to include log-likelihood ratios the pre-trained in relation background population model. The used segmentation clustering algorithms.