作者: Sebastian Ochs , Tim Schlippe , Tanja Schultz
DOI:
关键词: Natural language processing 、 Speech synthesis 、 International Phonetic Alphabet 、 Word (computer architecture) 、 Pronunciation 、 German 、 Computer science 、 Word error rate 、 Artificial intelligence
摘要: In this paper, we analyze whether dictionaries from the World Wide Web which contain phonetic notations, may support rapid creation of pronunciation within speech recognition and synthesis system building process. As a representative dictionary, selected Wiktionary [1] since it is at hand in multiple languages and, addition to definitions words, many notations terms International Phonetic Alphabet (IPA) are available. Given word lists four English, French, German, Spanish, calculated percentage words with Wiktionary. Furthermore, two quality checks were performed: First, compared pronunciations based on GlobalPhone project, had been created rule-based fashion manually cross-checked [2]. Second, analyzed impact automatic (ASR) systems. French achieved best coverage, containing 92.58% for list as well 76.12% 30.16% country international city names. our ASR systems evaluation, Spanish gained most improvement 7.22% relative error rate reduction.