Updated MINDS Report on Speech Recognition and Understanding

作者： S. Khudanpur , Li Deng , N. Morgan , J. Glass , C.-H. Lee

DOI:

关键词:

摘要: This article is the second part of an updated version “MINDS 2006–2007 Report Speech Understanding Working Group,” one five reports emanating from two workshops entitled “Meeting MINDS: Future Directions for Human Language Technology,” sponsored by U.S. Disruptive Technology Office (DTO). (MINDS acronym “machine translation, information retrieval, natural-language processing, data resources, and speech understanding.”) For further information, please see http://www.itl.nist.gov/iaui/894.02/ minds.html.

microsoft.com 本地加速

microsoft.com PDF 下载加速

参考文章(50)

Steven Pinker, The Language Instinct ,(1994)

J. S. George, C. J. Aine, J. C. Mosher, D. M. Schmidt, D. M. Ranken, H. A. Schlitt, C. C. Wood, J. D. Lewine, J. A. Sanders, J. W. Belliveau, Mapping function in the human brain with magnetoencephalography, anatomical magnetic resonance imaging, and functional magnetic resonance imaging. Journal of Clinical Neurophysiology. ,vol. 12, pp. 406- 431 ,(1995) , 10.1097/00004691-199509010-00002

Li Deng, Dong Yu, A. Acero, Structured speech modeling IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 14, pp. 1492- 1504 ,(2006) , 10.1109/TASL.2006.878265

P.W. Jusczyk, R.N. Aslin, Infants′ Detection of the Sound Patterns of Words in Fluent Speech Cognitive Psychology. ,vol. 29, pp. 1- 23 ,(1995) , 10.1006/COGP.1995.1010

Charlene O'Hanlon, A Conversation with John Hennessy and David Patterson: They wrote the book on computing. ACM Queue. ,vol. 4, pp. 14- 22 ,(2006) , 10.1145/1189276.1189286

M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cretin, H. Bourlard, M. Athineos, N. Morgan, Qifeng Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, Pushing the envelope - aside [speech recognition] IEEE Signal Processing Magazine. ,vol. 22, pp. 81- 88 ,(2005) , 10.1109/MSP.2005.1511826

Jenny R Saffran, Constraints on Statistical Language Learning Journal of Memory and Language. ,vol. 47, pp. 172- 196 ,(2002) , 10.1006/JMLA.2001.2839

M. Ostendorf, V.V. Digalakis, O.A. Kimball, From HMM's to segment models: a unified view of stochastic modeling for speech recognition IEEE Transactions on Speech and Audio Processing. ,vol. 4, pp. 360- 378 ,(1996) , 10.1109/89.536930

S. Axelrod, B. Maison, Combination of hidden Markov models with dynamic time warping for speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 173- 176 ,(2004) , 10.1109/ICASSP.2004.1325950

10.

J.-L. Gauvain, Chin-Hui Lee, Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 291- 298 ,(1994) , 10.1109/89.279278

Updated MINDS Report on Speech Recognition and Understanding

来源期刊

我的账户

Updated MINDS Report on Speech Recognition and Understanding

来源期刊

相似文章 10

我的账户