Transcription automatique de langues peu dotées

作者： Thomas Pellegrini

DOI:

关键词:

摘要: Les technologies liees a la parole, et en particulier reconnaissance de suscitent un grand interet pour nombre croissant langues. La tres grande majorite des langues du monde ne possedent pas grands corpus donnees necessaires l'elaboration systemes l'etat l'art, fondes sur paradigmes probabilistes plupart. travaux menes au cours cette these ont consiste, dans premier temps, identifier les difficultes rencontrees lors d'un systeme une langue peu dotee. Nous avons travaille principalement le probleme forts taux mots hors-vocabulaire dus manque textes, qui est nos yeux plus important ces defendons l'idee que l'utilisation sous-unites lexicales correctement selectionnees, peuvent etre petites mots, peut amener gains significatifs performances. utilise modifie algorithme probabiliste propose frontieres morphe, introduisant proprietes caracterisent confusion acoustico-phonetique eventuelle entre unites reconnaissance. experiences ete menees deux differentes : l'amharique turc, collaboration avec equipe chercheurs turcs, l'universite stambouliote Bogazici. permis d'obtenir modestes mais significatifs, autour 5% relatifs eleves, reductions relatives d'OOV comprises 30% 50%, etudiees.

theses.fr 本地加速

archives-ouvertes.fr 本地加速

archives-ouvertes.fr PDF 下载加速

参考文章(73)

Tanja Schultz, Mirjam Killer, Sebastian St ¨ uker, Grapheme Based Speech Recognition conference of the international speech communication association. ,(2003)

K. Carki, P. Geutner, T. Schultz, Turkish LVCSR: towards better speech recognition for agglutinative languages international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1563- 1566 ,(2000) , 10.1109/ICASSP.2000.861971

Martine Adda-Decker, Gilles Adda, Lori Lamel, Jean-Luc Gauvain, Text normalization and speech recognition in French. conference of the international speech communication association. ,(1997)

BF Grimes, RS Pittman, JE Grimes, Ethnologue : languages of the world Summer Institute of Linguistics. ,(1996)

Viet Bac Le, Reconnaissance automatique de la parole pour des langues peu dotées Université Joseph-Fourier - Grenoble I. ,(2006)

Petra Geutner, Michael Finke, Alex Waibel, Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multlingual broadcast news. conference of the international speech communication association. ,(1998)

Andreas Stolcke, SRILM – An Extensible Language Modeling Toolkit conference of the international speech communication association. ,(2002)

Daniel Yacob, Application of the Double Metaphone Algorithm to Amharic Orthography arXiv: Computation and Language. ,(2004)

P. Geutner, Using morphology towards better large-vocabulary speech recognition systems international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 445- 448 ,(1995) , 10.1109/ICASSP.1995.479624

10.

R. Iyer, M. Ostendorf, M. Meteer, Analyzing and predicting language model improvements ieee automatic speech recognition and understanding workshop. pp. 254- 261 ,(1997) , 10.1109/ASRU.1997.659013

Transcription automatique de langues peu dotées

来源期刊

我的账户

Transcription automatique de langues peu dotées

来源期刊

相似文章 3

Automatic Word Decompounding for ASR in a Morphologically Rich Language: Application to Amharic

Automatic Speech Recognition for African Languages with Vowel Length Contrast

Automatic speech recognition system for Tunisian dialect

我的账户