作者: Yoshua Bengio , Renato De Mori , Piero Cosi
DOI:
关键词:
摘要: The paper describes a speech coding system based on an ear model followed by set of MultiLayer Networks (MLN). MLNs are trained to learn how recognize articulatory features like the place and manner articulation. Experiments performed 10 English vowels showing recognition rate higher than 95% for new speakers. When used recognition, comparable results obtained diphthongs not training pronounced This suggests that suitably fed data computed have good generalization capabilities over speakers sounds.