Investigating recognition of children's speech

作者: D. Giuliani , M. Gerosa

DOI: 10.1109/ICASSP.2003.1202313

关键词:

摘要: Recognition of children's speech was investigated by considering a phone recognition task. Two baseline systems were trained, one for children and adults, exploiting two Italian databases. Under matching conditions, training performed with data from the same population group, accuracy 77.30% 79.43% respectively. It found that, many children, results as good adults. However, higher variability in across speakers observed than Vocal tract length normalization, under matched mismatched testing also investigated. For both adults performance improvement, respect to systems, observed.

参考文章(10)
Alexandros Potamianos, Sungbok Lee, Shrikanth S. Narayanan, Automatic speech recognition for children. conference of the international speech communication association. ,(1997)
Fabio Brugnara, Maurizio Omologo, Daniele Falavigna, Roberto Gretter, Diego Giuliani, Bianca Angelini, Speaker independent continuous speech recognition using an acoustic-phonetic Italian corpus. conference of the international speech communication association. ,(1994)
Martin J. Russell, Qun Li, An analysis of the causes of increased error rates in children²s speech recognition. conference of the international speech communication association. ,(2002)
Sudha Arunachalam, Elaine Andersen, Shrikanth S. Narayanan, Dani Byrd, Dylan Gould, Politeness and frustration language in child-machine interactions conference of the international speech communication association. pp. 2675- 2678 ,(2001)
Sungbok Lee, Alexandros Potamianos, Shrikanth Narayanan, Acoustics of children's speech: developmental changes of temporal and spectral parameters. Journal of the Acoustical Society of America. ,vol. 105, pp. 1455- 1468 ,(1999) , 10.1121/1.426686
Li Lee, R.C. Rose, Speaker normalization using efficient frequency warping procedures international conference on acoustics speech and signal processing. ,vol. 1, pp. 353- 356 ,(1996) , 10.1109/ICASSP.1996.541105
J.G. Wilpon, C.N. Jacobsen, A study of speech recognition for children and the elderly international conference on acoustics speech and signal processing. ,vol. 1, pp. 349- 352 ,(1996) , 10.1109/ICASSP.1996.541104
L. Welling, S. Kanthak, H. Ney, Improved methods for vocal tract normalization international conference on acoustics speech and signal processing. ,vol. 2, pp. 761- 764 ,(1999) , 10.1109/ICASSP.1999.759780
S. Das, D. Nix, M. Picheny, Improvements in children's speech recognition performance international conference on acoustics speech and signal processing. ,vol. 1, pp. 433- 436 ,(1998) , 10.1109/ICASSP.1998.674460
S. Narayanan, A. Potamianos, Creating conversational interfaces for children IEEE Transactions on Speech and Audio Processing. ,vol. 10, pp. 65- 78 ,(2002) , 10.1109/89.985544