Transcription of multi-variety portuguese media contents

作者: Alberto Abad , Hugo Meinedo , Isabel Trancoso , João Neto

DOI: 10.1007/978-3-642-28885-2_46

关键词: Variety (linguistics)Computer scienceEuropean PortuguesePortugueseBrazilian PortugueseNatural language processingArtificial intelligenceTranscription (software)Data searchAccent (sociolinguistics)Identification (information)

摘要: Current automatic transcription technology applied to media contents is an important medium that not only allows generating subtitles, but also enables data search and retrieval capabilities over multimedia streams. Among others, one of the most challenges systems have deal with speaker accent variability. In this work we study importance variability for three broad varieties Portuguese: African Portuguese, Brazilian Portuguese European Portuguese. Then, propose a multi-variety system based on combination variety identification followed by specific variety-dependent systems.

参考文章(21)
Céu Viana, Alberto Abad, Nelson Neto, Isabel Trancoso, Porting an european portuguese broadcast news recognition system to brazilian portuguese. conference of the international speech communication association. pp. 92- 95 ,(2009)
Philip C. Woodland, Jason J. Humphries, Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition conference of the international speech communication association. ,(1997)
João Paulo Neto, Alberto Abad, Incorporating acoustical modelling of phone transitions in an hybrid ANN/HMM speech recognizer conference of the international speech communication association. pp. 2394- 2397 ,(2008)
Céu Viana, Oscar Koller, Alberto Abad, Isabel Trancoso, Exploiting variety-dependent phones in portuguese variety identification applied to broadcast news transcription. conference of the international speech communication association. pp. 749- 752 ,(2010)
D. Caseiro, C. Viana, Ist, L. Oliveira Inesc-Id, Rua Alves Redol, I. Trancoso, GRAPHEME-TO-PHONE USING FINITE-STATE TRANSDUCERS ,(2002)
Eric Chang, Jian-Lai Zhou, Chao Huang, Stan Z. Li, Tao Chen, Analysis of speaker variability. conference of the international speech communication association. pp. 1377- 1380 ,(2001)
Paul M. Lewis, Ethnologue : languages of the world SIL International. ,(2009)
W.M. Campbell, J.P. Campbell, D.A. Reynolds, E. Singer, P.A. Torres-Carrasquillo, Support vector machines for speaker and language recognition Computer Speech & Language. ,vol. 20, pp. 210- 229 ,(2006) , 10.1016/J.CSL.2005.06.003
Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, J.R. Deller, Language identification using Gaussian mixture model tokenization IEEE International Conference on Acoustics Speech and Signal Processing. ,vol. 1, pp. 757- 760 ,(2002) , 10.1109/ICASSP.2002.5743828
Chao Huang, Tao Chen, Eric Chang, Accent Issues in Large Vocabulary Continuous Speech Recognition International Journal of Speech Technology. ,vol. 7, pp. 141- 153 ,(2004) , 10.1023/B:IJST.0000017014.52972.1D