Distributed speaker adaptation

作者: Petar Aleksic , Xin Lei

DOI:

关键词:

摘要: Automatic speech recognition (ASR) may be performed on received utterances. The ASR by an module of a computing device (e.g., client device). include: generating feature vectors based the utterances, updating feature-space speaker adaptation parameters, transcribing utterances to text strings, and parameters vectors. transcriptions based, at least in part, acoustic model updated Updated from another incorporated into module.

参考文章(152)
Joshua T. Goodman, A Bit of Progress in Language Modeling Extended Version Microsoft Research. pp. 72- ,(2001)
PC Woodland, Speaker adaptation for continuous density HMMs: a review ISCA: International Speech Communication Association. ,(2001)
Mehryar Mohri, Weighted Finite-State Transducer Algorithms. An Overview Formal Languages and Applications. pp. 551- 563 ,(2004) , 10.1007/978-3-540-39886-8_29
Tu Van Le, Dat Tran, Michael Wagner, Fuzzy Gaussian mixture models for speaker recognition. conference of the international speech communication association. ,(1998)
Xiaodong He, Jonathan Hamaker, Xin Lei, Patrick Nguyen, Speech recognition using adaptation and prior knowledge ,(2005)
Etienne Marcheret, Yuqing Gao, Hakan Erdogan, Yongxin Li, Incremental on-line feature space MLLR adaptation for telephony speech recognition. conference of the international speech communication association. ,(2002)
Xiaodong He, Xin Lei, Jon Hamaker, Robust feature space adaptation for telephony speech recognition. conference of the international speech communication association. ,(2006)
Toru Imai, Richard M. Schwartz, Topic indexing method ,(1998)
Michael Riley, Fernando Pereira, Andrej Ljolje, Efficient general lattice generation and rescoring. conference of the international speech communication association. ,(1999)
Mark Epstein, Hierarchical language models ,(2001)