Development of Acoustic and Linguistic Resources for Research and Evaluation in Interactive Vocal Information Servers

作者: Martin Rajman , Hervé Bourlard , Jean-Cédric Chappelier , Giulia Bernardis

DOI:

关键词:

摘要: This paper describes the setting up of a resource database for research and evaluation in domain interactive vocal information servers. All this development work took place project aiming at an advanced speech recognition system automatic processing telephone directory requests was performed on basis Swiss-French Polyphone (collected framework European SpeechDat project). Due to unavailability properly orthographically transcribed, consistently labeled tagged unconstrained (together with its associated lexicon) targeted area, we first concentrated annotation structuration spoken data order make it profitable lexical linguistic modeling results. A baseline then trained newly developed resources tested. Preliminary experiments showed relative improvement 46% Word Error Rate (WER) compared results previously obtained very similar but working unconsistent natural that originally available.

参考文章(6)
Martin Rajman, Hervé Bourlard, Jean-Cédric Chappelier, Giulia Bernardis, INtegrating SPEech acoustic and linguistic Constraints: Baseline System Development IDIAP. ,(1999)
Cédric Jaboulet, Philippe Langlais, Jean-Luc Cochard, Andrei Constantinescu, Gérard Chollet, Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability IDIAP. ,(1996)
Hervé Bourlard, R. Boite, T. Dutoit, H. Leich, J. Hancq, Traitement de la Parole Presses Polytechniques Universitaires Romandes. ,(2000)
Martin Rajman, Jean-Cédric Chappelier, A generalized CYK algorithm for parsing stochastic CFG Proc. of 1st Workshop on Tabulation in Parsing and Deduction (TAPD"98). pp. 133- 137 ,(1998)
H. Hermansky, N. Morgan, RASTA processing of speech IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 578- 589 ,(1994) , 10.1109/89.326616
Y. Gotoh, S. Renals, G. Williams, Named entity tagged language models international conference on acoustics speech and signal processing. ,vol. 1, pp. 513- 516 ,(1999) , 10.1109/ICASSP.1999.758175