Time synchronous decoding for long-span hidden trajectory model

作者: Li Deng , Dong Yu , Alejandro Acero , Xiaolong Li

DOI:

关键词:

摘要: A time-synchronous lattice-constrained search algorithm is developed and used to process a linguistic model of speech that has long-contextual-span capability. In the algorithm, hypotheses are represented as traces include an indication current frame, previous frames future frames. Each frame can associated unit such phone or units derived from phone. Additionally, pruning strategies be applied speed up search. Further, word-ending recombination methods computation. These effectively deal with exponentially increased space.

参考文章(34)
Volker Steinbiss, Bach-Hiep Tran, Hermann Ney, Improvements in beam search. conference of the international speech communication association. ,(1994)
Achim Sixtus, Hermann Ney, Across-word phoneme models for large vocabulary continuous speech recognition Publikationsserver der RWTH Aachen University. ,(2003)
Eiichi Tsuboka, Pattern recognition apparatus ,(1991)
Li Deng, Dong Yu, Alejandro Acero, Quantitative model for formant dynamics and contextually assimilated reduction in fluent speech conference of the international speech communication association. ,(2004)
Konstantinos Koumpis, Soren Riis, Speech recognition method and system ,(2001)
Philip Neil Garner, Asako Higuchi, Jason Peter Andrew Charlesworth, Pattern matching method and apparatus ,(2000)
Matthew Lennig, Vishwa Nath Gupta, Speech recognition method using a two-pass search ,(1994)