Evaluation of the CMU ATIS system

作者: Wayne Ward

DOI: 10.3115/112405.112419

关键词:

摘要: The CMU Phoenix system is an experiment in understanding spontaneous speech. It has been implemented for the Air Travel Information Service task. In this task, casual users are asked to obtain information from a database of air travel information. Users not given vocabulary, grammar or set sentences read. They compose queries themselves manner. This task presents speech recognizers with many new problems compared Resource Management Not only fluent, but vocabulary and open. Also, just produce transcription, action, retrieve data database. Taking such actions requires parsing "understanding" utterance. Word error rate as important utterance rate.Phoenix attempts deal phenomena that occur Unknown words, restarts, repeats, poorly formed unusual common very disruptive standard recognizers. These events lead misrecognitions which often cause total parse failure. Our strategy apply grammatical constraints at phrase level use semantic rather than lexical grammars. Semantics provide more constraint parts must ultimately be delt order take actions. Applying flexible recognizing whole while providing much word-spotting. Restarts repeats most between phase occurences, so individual phrases can still recognized correctly. Poorly constructed consists well-formed phrases, semantically well-formed. syntactically incorrect. We associate by frame-based semantics. Phrases represent word strings fill slots frames. frame able act on.The current uses bigram language model Sphinx recognition system. top-scoring string passed parser. parser assigns (word strings) input content needed frame. A beam hypotheses produced best scoring one used SQL query.

参考文章(10)
Raj Reddy, Kai-Fu Lee, Automatic Speech Recognition: The Development of the Sphinx Recognition System Kluwer Academic Publishers. ,(1988)
Jaime G. Carbonell, Philip J. Hayes, Recovery strategies for parsing extragrammatical language Computational Linguistics. ,vol. 9, pp. 123- 146 ,(1983)
Ayman Asadi, Richard Schwartz, John Makhoul, Automatic detection of new words in a large vocabulary continuous speech recognition system Proceedings of the workshop on Speech and Natural Language - HLT '89. pp. 263- 265 ,(1989) , 10.3115/1075434.1075477
Hsiao-Wuen Hon, Kai-Fu Lee, Robert Weide, Towards speech recognition without vocabulary-specific training Proceedings of the workshop on Speech and Natural Language - HLT '89. pp. 271- 275 ,(1989) , 10.3115/1075434.1075479
Wayne Ward, The CMU air travel information service: understanding spontaneous speech human language technology. pp. 127- 129 ,(1990) , 10.3115/116580.116621
Wayne Ward, Modelling non-verbal sounds for speech recognition Proceedings of the workshop on Speech and Natural Language - HLT '89. pp. 47- 50 ,(1989) , 10.3115/1075434.1075443
Wayne Ward, Understanding spontaneous speech human language technology. pp. 137- 141 ,(1989) , 10.3115/100964.100975
S. L. Young, A. G. Hauptmann, W. H. Ward, E. T. Smith, P. Werner, High level knowledge sources in usable speech recognition systems Communications of the ACM. ,vol. 32, pp. 183- 194 ,(1989) , 10.1145/63342.63344
J.G. Wilpon, L.R. Rabiner, C.-H. Lee, E.R. Goldman, Automatic recognition of keywords in unconstrained speech using hidden Markov models IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 38, pp. 1870- 1878 ,(1990) , 10.1109/29.103088