作者: X. Huang
DOI: 10.1109/ICASSP.1992.225871
关键词: Computer science 、 Normalization (statistics) 、 Speech recognition 、 Speech synthesis 、 Speaker recognition 、 Nonlinear system 、 Artificial intelligence 、 Word error rate 、 Loudspeaker 、 Artificial neural network 、 Training set 、 Pattern recognition
摘要: A codeword-dependent neural network (CDNN) is presented for the study of speaker adaptation. The CDNN used as a nonlinear mapping function to transform speech data between two speakers. characterized by number important properties. First, assembly functions enhances overall quality. Second, multiple input vectors are simultaneously in transformation. This not only makes full use dynamic information but also alleviates possible errors supervision data. Finally, derived from training data, with quality dependent on available amount Based speaker-dependent models, performance evaluation showed that normalization significantly reduced error rate 41.9% 5.0%. >