Segmentation, classification and clustering of an Italian broadcast news corpus

作者: Mauro Cettolo

DOI:

关键词: SegmentationComputer scienceSpeech recognitionScale-space segmentationBayesian information criterionPattern recognitionWord error rateAutomatic segmentationData streamCluster analysisArtificial intelligence

摘要: This work reports on preliminary activity at ITC-irst the problem of acoustic segmentation, classification and clustering an Italian audio broadcast news corpus. The approach is based following stages. First, input data stream segmented by detecting spectral changes through Bayesian Information Criterion (BIC). Second, segments are classified in terms conditions, modeled mixtures Gaussians. Finally, from same speakers clustered, using again BIC. The scheme proposed for automatic causes a degradation recognition error rate, with respect to fully supervisioned experiment, equal 1.3% before adaptation, 3.4% after adaptation.

参考文章(9)
Francis Kubala, Daben Liu, Fast speaker change detection for broadcast news transcription and indexing. conference of the international speech communication association. pp. 1031- 1034 ,(1999)
Reinhold Haeb-Umbach, Peter Beyerlein, Xavier L. Aubert, Matthew J. Harris, A study of broadcast news audio stream segmentation and segment clustering conference of the international speech communication association. ,(1999)
Ramesh A. Gopinath, Alain Tritschler, Improved speaker segmentation and segments clustering using the bayesian information criterion. conference of the international speech communication association. ,(1999)
S. Chen, Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion Proc. DARPA Broadcast News Transcription and Understanding Workshop, 1998. ,(1998)
F. Brugnara, M. Cettolo, M. Federico, D. Giuliani, A baseline for the transcription of Italian broadcast news international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1667- 1670 ,(2000) , 10.1109/ICASSP.2000.862070
M. Federico, M. Cettolo, F. Brugnara, D. Giuliani, A system for the segmentation and transcription of Italian radio news riao conference. pp. 364- 371 ,(2000)
A Tuerk, PC Woodland, SJ Young, T Hain, SE Johnson, Segment generation and clustering in the HTK broadcast news transcription system DARPA. ,(1998)
K. Kocherlakota, S. Kocherlakota, W. J. Krzanowski, Principles of multivariate analysis: a user's perspective Biometrics. ,vol. 45, pp. 1338- ,(1988) , 10.2307/2531791