Strategies for lexical access to very large vocabularies

作者: L. Fissore , G. Micca , R. Pieraccini , P. Palace

DOI: 10.1016/0167-6393(88)90051-9

关键词: Artificial intelligenceHidden Markov modelSpeech recognitionGraph (abstract data type)Word (computer architecture)Viterbi algorithmVocabularyTree structureComputer scienceNatural language processingWord error rateWord recognition

摘要: Abstract A large vocabulary isolated word recognition system is described on a two pass strategy: hypothesization and verification. Word preselection achieved by segmenting classifying the input signal in terms of 6 broad phonetic classes. To reduce storage computational costs, lexical knowledge organized tree structure where initial common subsequences descriptions are shared, beam-search Dynamic Programming algorithm carries most promising paths only. In second pass, verification, detailed representation phonemic candidates used for estimating likely words. Each candidate modeled graph subword Hidden Markov Models. Again, tree-structure whole subset built online an efficient implementation Viterbi that estimates likelihood candidates. The results show complexity reduction about 73% can be using approach with respect to direct approach, while accuracy remains comparable.

参考文章(24)
Pietro Laface, G. Micca, R. Pieraccini, Three dimensional DP for phonetic lattice matching international conference on digital signal processing. pp. 547- 551 ,(1987)
P. Laface, G. Micca, R. Pieraccini, Recognition of words in very large vocabulary Proceedings of the NATO Advanced Study Institute on Recent advances in speech understanding and dialog systems. pp. 235- 254 ,(1988) , 10.1007/978-3-642-83476-9_21
L. Fissore, E. Giachin, P. Laface, G. Micca, R. Pieraccini, C. Rullent, Experimental results on large-vocabulary continuous speech recognition and understanding international conference on acoustics speech and signal processing. pp. 414- 417 ,(1988) , 10.1109/ICASSP.1988.196606
P. Laface, G. Micca, R. Pieraccini, Experimental results on a large lexicon access task international conference on acoustics, speech, and signal processing. ,vol. 12, pp. 809- 812 ,(1987) , 10.1109/ICASSP.1987.1169759
R. Billi, G. Massia, F. Nesti, Word preselection for large vocabulary speech recognition international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 65- 68 ,(1986) , 10.1109/ICASSP.1986.1169181
M. Cravero, R. Pieraccini, F. Raineri, Definition and evaluation of phonetic units for speech recognition by hidden Markov models international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 2235- 2238 ,(1986) , 10.1109/ICASSP.1986.1168550
A. Waibel, Prosodic knowledge sources for word hypothesization in a continuous speech recognition system international conference on acoustics, speech, and signal processing. ,vol. 12, pp. 534- 537 ,(1987) , 10.1109/ICASSP.1987.1169848
G. Schukat-Talamazzini, H. Niemann, Generating word hypotheses in continuous speech international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 1565- 1568 ,(1986) , 10.1109/ICASSP.1986.1168946
V. Gupta, M. Lennig, P. Mermelstein, Integration of acoustic information in a large vocabulary word recognizer international conference on acoustics, speech, and signal processing. ,vol. 12, pp. 697- 700 ,(1987) , 10.1109/ICASSP.1987.1169578