Segmentation, classification and clustering of an Italian broadcast news corpus

作者： Mauro Cettolo

DOI:

关键词: Segmentation 、 Computer science 、 Speech recognition 、 Scale-space segmentation 、 Bayesian information criterion 、 Pattern recognition 、 Word error rate 、 Automatic segmentation 、 Data stream 、 Cluster analysis 、 Artificial intelligence

摘要: This work reports on preliminary activity at ITC-irst the problem of acoustic segmentation, classification and clustering an Italian audio broadcast news corpus. The approach is based following stages. First, input data stream segmented by detecting spectral changes through Bayesian Information Criterion (BIC). Second, segments are classified in terms conditions, modeled mixtures Gaussians. Finally, from same speakers clustered, using again BIC. The scheme proposed for automatic causes a degradation recognition error rate, with respect to fully supervisioned experiment, equal 1.3% before adaptation, 3.4% after adaptation.

acm.org UNKNOWN 下载加速

参考文章(9)

Francis Kubala, Daben Liu, Fast speaker change detection for broadcast news transcription and indexing. conference of the international speech communication association. pp. 1031- 1034 ,(1999)

Reinhold Haeb-Umbach, Peter Beyerlein, Xavier L. Aubert, Matthew J. Harris, A study of broadcast news audio stream segmentation and segment clustering conference of the international speech communication association. ,(1999)

Ramesh A. Gopinath, Alain Tritschler, Improved speaker segmentation and segments clustering using the bayesian information criterion. conference of the international speech communication association. ,(1999)

S. Chen, Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion Proc. DARPA Broadcast News Transcription and Understanding Workshop, 1998. ,(1998)

F. Brugnara, M. Cettolo, M. Federico, D. Giuliani, A baseline for the transcription of Italian broadcast news international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1667- 1670 ,(2000) , 10.1109/ICASSP.2000.862070

M. Federico, M. Cettolo, F. Brugnara, D. Giuliani, A system for the segmentation and transcription of Italian radio news riao conference. pp. 364- 371 ,(2000)

A Tuerk, PC Woodland, SJ Young, T Hain, SE Johnson, Segment generation and clustering in the HTK broadcast news transcription system DARPA. ,(1998)

K. Kocherlakota, S. Kocherlakota, W. J. Krzanowski, Principles of multivariate analysis: a user's perspective Biometrics. ,vol. 45, pp. 1338- ,(1988) , 10.2307/2531791

Perrine Delacourt, Speaker-based segmentation for audio data indexing ISCA. ,(1999)

Segmentation, classification and clustering of an Italian broadcast news corpus

来源期刊

我的账户

Segmentation, classification and clustering of an Italian broadcast news corpus

来源期刊

相似文章 10

我的账户