作者: M. Nishimura , K. Toshioka
DOI: 10.1109/ICASSP.1987.1169883
关键词: Frame (networking) 、 Multi dimensional 、 Mathematics 、 Sequence labeling 、 Speech recognition 、 Vector quantization 、 Word recognition 、 Word error rate 、 Task (computing) 、 Hidden Markov model 、 Artificial intelligence 、 Pattern recognition
摘要: This paper describes a new vector quantization (VQ; so-called labeling) method of speech recognition system based on hidden Markov model (HMM). For improving the VQ accuracy in simple manner, "multi-labeling" which generates multiple labels at each frame was introduced while keeping conventional HMM formulation. Furthermore, order to represent characteristics accurately and effectively, "multi-dimensional labeling" also quantizes features such as spectral dynamics spectrum independently. labeling tested an isolated word task using 150 Japanese confusable words. The error rate roughly reduced 1/2 or less compared with method.