Speech processing : a dynamic and optimization-oriented approach

作者: Li Deng , Douglas O'Shaughnessy

DOI: 10.1201/9781482276237

关键词: Speech scienceSpeech codingSpeech enhancementSpeech processingAcoustic modelSpeech synthesisSpeech technologyArtificial intelligenceComputer scienceSpeech recognitionNatural language processingSpeech production

摘要: Analytical background and techniques: discrete-time signals, systems transforms analysis of speech signals probability random processes linear model dynamic system optimization methods estimation theory statistical pattern recognition. Fundamentals science: phonetic process phonological process. Computational phonology phonetics: computational models for production auditory speechprocessing. Speech technology in selected areas: recognition enhancement synthesis.

参考文章(287)
Harlan Lane, Jane Wozniak Webster, Speech deterioration in postlingually deafened adults Journal of the Acoustical Society of America. ,vol. 89, pp. 859- 866 ,(1991) , 10.1121/1.1894647
J. Picone, S. Pike, R. Regan, T. Kamm, J. Bridle, L. Deng, Z. Ma, H. Richards, M. Schuster, Initial evaluation of hidden dynamic models on conversational speech international conference on acoustics speech and signal processing. ,vol. 1, pp. 109- 112 ,(1999) , 10.1109/ICASSP.1999.758074
L.R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition Proceedings of the IEEE. ,vol. 77, pp. 267- 296 ,(1989) , 10.1109/5.18626
P.J. Moreno, B. Raj, R.M. Stern, A vector Taylor series approach for environment-independent speech recognition international conference on acoustics speech and signal processing. ,vol. 2, pp. 733- 736 ,(1996) , 10.1109/ICASSP.1996.543225
R. Kuhn, R. De Mori, A cache-based natural language model for speech recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 12, pp. 570- 583 ,(1990) , 10.1109/34.56193
Kuansan Wang, S. Shamma, Self-normalization and noise-robustness in early auditory representations IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 421- 435 ,(1994) , 10.1109/89.294356
N.Z. Tisby, On the application of mixture AR hidden Markov models to text independent speaker recognition IEEE Transactions on Signal Processing. ,vol. 39, pp. 563- 570 ,(1991) , 10.1109/78.80876
Richard Sproat, Chilin Shih, Jan P. H. van Santen, Yuri Pavlov, Elena Pavlova, Bell laboratories Russian text-to-speech system. conference of the international speech communication association. ,(1997)
Robert M. Gray, Toeplitz and circulant matrices ,(1977)