作者: Hugo Meinedo , Thomas Pellegrini , Alberto Abad , Inesc-Id Lisboa , Isabel Trancoso
DOI:
关键词:
摘要: Abstract Broadcast news play an important role in our lives provid-ing access to news, information and entertainment. The ex-istence of automatic transcription is mediumthat not only can provide subtitles for inclusion people withspecial needs or be advantage on noisy populated envi-ronments, but also because it enables data search retrievecapabilities over the multimedia streams. In this work we willdescribe evaluate speech recognition systemsdeveloped two Iberian languages, European Portuguese andSpanish Brazilian Portuguese, African Portugueseand English. developed systems are fully andcapable subtitling real-time News stream with avery small delay.Index Terms: Speech Recognition, News, Iberianlanguages, Accent, Online processing 1. Introduction (BN) system at theSpoken Language Systems Lab INESC-ID integrates sev-eral core technologies, a pipeline architecture: jingle detec-tion, audio segmentation, recognition, punc-tuation, capitalization, topic segmentation/indexation, summa-rization, translation. first modules wereoptimized on-line performance, given their deployment inthe that isrunning main shows public TV channel inPortugal (RTP), since March 2008.To knowledge, majority de-scribed literature rely speech-to-text alignment ratherthan full [1]. Re-speakers alsoare commonly used simplify original speech, speechrecognition engines adapted captioner voice [2].This paper concerns third module -speech emphasizing most recent improvements,and efforts port other languages (English Span-ish), varieties namely those spokenin South American continents.The development new language chal-lenging task due need acoustic training data, vo-cabulary definition, lexicon generation model es-timation [3].The starts description ofour engine, indepen-dent components - feature extraction decoder. nextthree sections devoted three Portuguesecovered by system: one (European Portuguese,henceforth designated as EP), (BP), andAfrican (AP). porting twolanguages Spanish English) Sections 6 7, respectively. For each these sec-tions, shall detail corpora, vocabulary, lexical andlanguage generation, ending performance results.The final section discusses advantages shortcom-ings systems, what real time closecaptioning applications.