作者: A. Nagai , K. Yamaguchi , S. Sagayama , A. Kurematsu
DOI: 10.1109/ICASSP.1993.319251
关键词: Hidden Markov model 、 Artificial neural network 、 Computer science 、 Fuzzy logic 、 Cepstrum 、 Vector quantization 、 Pattern recognition 、 Speech recognition 、 Phrase 、 Artificial intelligence 、 Codebook 、 Speech synthesis 、 Phone
摘要: The authors describe ATREUS, an aggregation of a large variety continuous speech recognition systems, forming the spoken input front-end interpreting telephony system. ATREUS includes following phone models: discrete HMMs (hidden Markov models) with fuzzy vector quantization (VQ) and multiple codebooks; mixture density HMMs; hidden networks derived from SSS (successive state splitting) algorithm; time-delay-neural networks; partition models. Its speaker modes involve speaker-dependent, speaker-independent, speaker-adaptive techniques such as codebook mapping for VQ-HMMs, field smoothing all types HMMs, neural network mapping. A comparative study is given viewpoints structure, constituent techniques, hardware implementation, performance. was evaluated Japanese phrase recognition. combination called ATREUS/SSS-LR had best performance among systems. >