AUDIMUS.MEDIA: A Broadcast News Speech Recognition System for the European Portuguese Language

作者: Hugo Meinedo , Diamantino Caseiro , João Neto , Isabel Trancoso

DOI: 10.1007/3-540-45011-4_2

关键词:

摘要: Many applications such as media monitoring are experiencing a large expansion consequence of the different emerging sources and can benefit dramatically by using automatic transcription audio data. In this paper, we describe development speech recognition engine, AUDIMUS.MEDIA used in Broadcast News domain. Additionally recent improvements that permitted relative error decrease more than 20% 4x speed-up.

参考文章(9)
Diamantino Caseiro, Isabel Trancoso, On integrating the lexicon with the language model. conference of the international speech communication association. pp. 2131- 2134 ,(2001)
Diamantino Caseiro, Isabel Trancoso, Using dynamic WFST composition for recognizing broadcast news. conference of the international speech communication association. ,(2002)
Hugo Meinedo, João Paulo Neto, Combination of acoustic models in continuous speech recognition hybrid systems. conference of the international speech communication association. pp. 931- 934 ,(2000)
Hugo Meinedo, Nuno Souto, Rui Amaral, João Paulo Neto, Thibault Langlois, Isabel Trancoso, The development of a portuguese version of a media watch system. conference of the international speech communication association. pp. 2689- 2692 ,(2001)
D. Caseiro, F.M. Silva, Rua Alves Redol, C. Viana, AUTOMATIC ALIGNMENT OF MAP TASK DIALOGS USING WFSTS ,(2000)
S. Renals, M. Hochberg, Efficient search using posterior phone probability estimates international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 596- 599 ,(1995) , 10.1109/ICASSP.1995.479668
Mehryar Mohri, Fernando Pereira, Michael Riley, Weighted finite-state transducers in speech recognition Computer Speech & Language. ,vol. 16, pp. 69- 88 ,(2002) , 10.1006/CSLA.2001.0184
D. Caseiro, I. Trancoso, Transducer composition for "on-the-fly" lexicon and language model integration ieee automatic speech recognition and understanding workshop. pp. 393- 396 ,(2001) , 10.1109/ASRU.2001.1034667
M. Mohri, Fernando Pereira, Michael Riley, Weighted finite state transducers in speech recognition Proceedings of the Automatic Speech Recognition Workshop, Paris, France, 2000. ,(2000)