Description of acoustic variations by hidden Markov models with tree structure

作者: Hsiao-Wuen Hon , Satoru Hayamizu , Kai-Fu Lee

DOI:

关键词:

摘要: This research was sponsored in part by U S WEST and the Defense Advanced Research Projects Agency (DOD), monitored Space Naval Warfare Systems Command under Contract N0003985-C-0163, ARPA Order No. 5167. The views conclusions contained this document are those of authors should not be interpreted as representing official policies, either expressed or implied, WEST, DARPA US government. K e y w o r d s : HMM(Hidden Markov Model), Binary-Tree Vector Quantization, Decision Tree Clustering, CART, Speaker Smoothing.

参考文章(13)
L.R. Bahl, R. Bakis, J. Bellegarda, P.F. Brown, D. Burshtein, S.K. Das, P.V. de Souza, P.S. Gopalakrishnan, F. Jelinek, D. Kanevsky, R.L. Mercer, A.J. Nadas, D. Nahamoo, M.A. Picheny, Large vocabulary natural language continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 26, pp. 465- 467 ,(1989) , 10.1109/ICASSP.1989.266464
F. Jelinek, Interpolated estimation of Markov source parameters from sparse data Proc. Workshop on Pattern Recognition in Practice, 1980. pp. 381- 397 ,(1980)
K.-F. Lee, H.-W. Hon, M.-Y. Hwang, S. Mahajan, R. Reddy, The SPHINX speech recognition system international conference on acoustics, speech, and signal processing. pp. 445- 448 ,(1989) , 10.1109/ICASSP.1989.266459
R. Schwartz, Y. Chow, O. Kimball, S. Roucos, M. Krasner, J. Makhoul, Context-dependent modeling for acoustic-phonetic recognition of continuous speech international conference on acoustics, speech, and signal processing. ,vol. 10, pp. 1205- 1208 ,(1985) , 10.1109/ICASSP.1985.1168283
S. Sagayama, Phoneme environment clustering for speech recognition international conference on acoustics, speech, and signal processing. pp. 397- 400 ,(1989) , 10.1109/ICASSP.1989.266449
Hsiao-Wuen Hon, Kai-Fu Lee, Robert Weide, Towards speech recognition without vocabulary-specific training Proceedings of the workshop on Speech and Natural Language - HLT '89. pp. 271- 275 ,(1989) , 10.3115/1075434.1075479
K.-F. Lee, S. Hayamizu, H.-W. Hon, C. Huang, J. Swartz, R. Weide, Allophone clustering for continuous speech recognition international conference on acoustics, speech, and signal processing. pp. 749- 752 ,(1990) , 10.1109/ICASSP.1990.115900
H.-W. Hon, K.-F. Lee, On vocabulary-independent speech modeling international conference on acoustics, speech, and signal processing. pp. 725- 728 ,(1990) , 10.1109/ICASSP.1990.115887
R. Gray, Y. Linde, Vector Quantizers and Predictive Quantizers for Gauss-Markov Sources IEEE Transactions on Communications. ,vol. 30, pp. 381- 389 ,(1982) , 10.1109/TCOM.1982.1095471