Updated MINDS Report on Speech Recognition and Understanding

作者: S. Khudanpur , Li Deng , N. Morgan , J. Glass , C.-H. Lee

DOI:

关键词:

摘要: This article is the second part of an updated version “MINDS 2006–2007 Report Speech Understanding Working Group,” one five reports emanating from two workshops entitled “Meeting MINDS: Future Directions for Human Language Technology,” sponsored by U.S. Disruptive Technology Office (DTO). (MINDS acronym “machine translation, information retrieval,  natural-language processing, data resources, and speech understanding.”) For further information, please see http://www.itl.nist.gov/iaui/894.02/ minds.html.

参考文章(50)
Steven Pinker, The Language Instinct ,(1994)
J. S. George, C. J. Aine, J. C. Mosher, D. M. Schmidt, D. M. Ranken, H. A. Schlitt, C. C. Wood, J. D. Lewine, J. A. Sanders, J. W. Belliveau, Mapping function in the human brain with magnetoencephalography, anatomical magnetic resonance imaging, and functional magnetic resonance imaging. Journal of Clinical Neurophysiology. ,vol. 12, pp. 406- 431 ,(1995) , 10.1097/00004691-199509010-00002
Li Deng, Dong Yu, A. Acero, Structured speech modeling IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 14, pp. 1492- 1504 ,(2006) , 10.1109/TASL.2006.878265
P.W. Jusczyk, R.N. Aslin, Infants′ Detection of the Sound Patterns of Words in Fluent Speech Cognitive Psychology. ,vol. 29, pp. 1- 23 ,(1995) , 10.1006/COGP.1995.1010
M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cretin, H. Bourlard, M. Athineos, N. Morgan, Qifeng Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, Pushing the envelope - aside [speech recognition] IEEE Signal Processing Magazine. ,vol. 22, pp. 81- 88 ,(2005) , 10.1109/MSP.2005.1511826
Jenny R Saffran, Constraints on Statistical Language Learning Journal of Memory and Language. ,vol. 47, pp. 172- 196 ,(2002) , 10.1006/JMLA.2001.2839
M. Ostendorf, V.V. Digalakis, O.A. Kimball, From HMM's to segment models: a unified view of stochastic modeling for speech recognition IEEE Transactions on Speech and Audio Processing. ,vol. 4, pp. 360- 378 ,(1996) , 10.1109/89.536930
S. Axelrod, B. Maison, Combination of hidden Markov models with dynamic time warping for speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 173- 176 ,(2004) , 10.1109/ICASSP.2004.1325950
J.-L. Gauvain, Chin-Hui Lee, Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 291- 298 ,(1994) , 10.1109/89.279278