Detection-Based Decoder

作者: Qi Li

DOI: 10.1007/978-3-642-23731-7_6

关键词: Sequential probability ratio testBeam diameterPath (graph theory)Computer scienceSpeech recognitionTask (project management)Speaker recognitionHidden Markov modelDecoding methodsViterbi algorithm

摘要: Decoding or searching is an important task in both speaker and speech recognition. In verification (SV), given a spoken password speakerdependent hidden Markov model (HMM), the of decoding to find optimal state alignments sense maximum likelihood score entire utterance. Currently, most popular algorithm Viterbi with pre-defined beam width reduce search space; however, it difficult determine suitable beforehand. A small may miss path while large one slow down process. To address problem, author has developed non-heuristic space. The details are presented this chapter.

参考文章(28)
Qi Li, A fast decoding algorithm based on sequential detection of the changes in distribution. conference of the international speech communication association. ,(1998)
Dimitri Kazakos, P. Papantoni-Kazakos, Detection and Estimation ,(1989)
B. E. Brodsky, B. S. Darkhovsky, Nonparametric methods in change-point problems Kluwer Academic Publishers. ,(1993) , 10.1007/978-94-015-8163-9
John G. Proakis, John R. Deller, John H. Hansen, Discrete-Time Processing of Speech Signals ,(1993)
L.R. Bahl, R. Bakis, J. Bellegarda, P.F. Brown, D. Burshtein, S.K. Das, P.V. de Souza, P.S. Gopalakrishnan, F. Jelinek, D. Kanevsky, R.L. Mercer, A.J. Nadas, D. Nahamoo, M.A. Picheny, Large vocabulary natural language continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 26, pp. 465- 467 ,(1989) , 10.1109/ICASSP.1989.266464
Wayne A. Lea, Trends in Speech Recognition Prentice Hall PTR. ,(1980)
S. Parthasarathy, A.E. Rosenberg, General phrase speaker verification using sub-word background models and likelihood-ratio scoring international conference on spoken language processing. ,vol. 4, pp. 2403- 2406 ,(1996) , 10.1109/ICSLP.1996.607293
Sriram Srinivasan, Ashish Vijay Pandharipande, Speech signal processing ,(2009)
H. Ney, R. Haeb-Umbach, B.-H. Tran, M. Oerder, Improvements in beam search for 10000-word continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 9- 12 ,(1992) , 10.1109/ICASSP.1992.225985