作者: Alberto Abad , Hugo Meinedo , Isabel Trancoso , João Neto
DOI: 10.1007/978-3-642-28885-2_46
关键词: Variety (linguistics) 、 Computer science 、 European Portuguese 、 Portuguese 、 Brazilian Portuguese 、 Natural language processing 、 Artificial intelligence 、 Transcription (software) 、 Data search 、 Accent (sociolinguistics) 、 Identification (information)
摘要: Current automatic transcription technology applied to media contents is an important medium that not only allows generating subtitles, but also enables data search and retrieval capabilities over multimedia streams. Among others, one of the most challenges systems have deal with speaker accent variability. In this work we study importance variability for three broad varieties Portuguese: African Portuguese, Brazilian Portuguese European Portuguese. Then, propose a multi-variety system based on combination variety identification followed by specific variety-dependent systems.