Speech recognition: Statistical methods

作者: L.R. Rabiner , B.-H. Juang

DOI: 10.1016/B0-08-044854-2/00907-X

关键词:

摘要: Statistical methods for speech processing refer to a general methodology in which knowledge about both signal and the language that it expresses, along with practical uses of specific tasks or services, is developed from actual realizations data through well-defined mathematical statistical formalism. For more than 20 years, this basic has produced many advances new results, particularly recognizing understanding natural by machine. In article, we focus on two important methods, one based primarily hidden Markov model formulation gained widespread acceptance as dominant technique characterizing variation acoustic representing speech, related use statistics word co-occurrences. This second acts form grammar set syntactical constraints language. contrast earlier systems employed linguistic analyses, these data-driven have proven produce consistent useful results become underpinning technology modern recognition systems. Such are used wide range applications such automatic telephone call routing information retrieval.

参考文章(47)
Dan Jurafsky, James H. Martin, Speech and Language Processing ,(1999)
Candace Kamm, Martin Helander, Design Issues for Interfaces using Voice Input Handbook of Human-Computer Interaction. pp. 1043- 1059 ,(1997) , 10.1016/B978-044481862-1.50109-6
N. Sugamura, T. Hirokawa, S. Sagayama, S. Furui, Speech processing technologies and telecommunications applications at NTT Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications. pp. 37- 42 ,(1994) , 10.1109/IVTTA.1994.341548
Wayne Ward, Evaluation of the CMU ATIS system human language technology. pp. 101- 105 ,(1991) , 10.3115/112405.112419
Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370
David S. Pallett, Jonathan G. Fiscus, William M. Fisher, John S. Garofolo, Bruce A. Lund, Mark A. Przybocki, 1993 benchmark tests for the ARPA spoken language program Proceedings of the workshop on Human Language Technology - HLT '94. pp. 49- 74 ,(1994) , 10.3115/1075812.1075824