The thoughtful elephant: strategies for spoken dialog systems

作者: B. Souvignier , A. Kellner , B. Rueber , H. Schramm , F. Seide

DOI: 10.1109/89.817453

关键词:

摘要: We present technology used in spoken dialog systems for applications of a wide range. They include tasks from the travel domain and automatic switchboards as well large scale directory assistance. The overall goal developing is to allow natural flexible flow similar human-human interaction. This imposes challenging task recognize interpret user input, where he/she allowed choose an unrestricted vocabulary infinite set possible formulations. therefore put emphasis on strategies that make system more robust while still maintaining high level naturalness flexibility. In view this paradigm, we found two fundamental principles characterize many proposed methods: consider available sources information early possible; keep alternative hypotheses delay decision single option long possible. describe how our architecture caters incorporating application specific knowledge, including, example, database constraints, determination best sentence hypothesis turn. On next higher level, use history assess plausibility by applying consistency checks with items previous turns. particular, demonstrate combination decisions over several turns can be exploited boost recognition performance system.

参考文章(34)
R. Kneser, J. Peters, Semantic clustering for adaptive language modeling international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 779- 782 ,(1997) , 10.1109/ICASSP.1997.596041
Andreas Stolcke, Jonathan Segal, Precise n-gram probabilities from stochastic context-free grammars Proceedings of the 32nd annual meeting on Association for Computational Linguistics -. pp. 74- 79 ,(1994) , 10.3115/981732.981743
M. Weintraub, LVCSR log-likelihood ratio scoring for keyword spotting international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 297- 300 ,(1995) , 10.1109/ICASSP.1995.479532
R. Billi, F. Canavesio, C. Rullent, Automation of Telecom Italia directory assistance service: field trial results Proceedings 1998 IEEE 4th Workshop Interactive Voice Technology for Telecommunications Applications. IVTTA '98 (Cat. No.98TH8376). pp. 11- 16 ,(1998) , 10.1109/IVTTA.1998.727685
A. Kellner, F. Seide, B. Rueber, With a little help from the database-developing voice-controlled directory information systems ieee automatic speech recognition and understanding workshop. pp. 566- 574 ,(1997) , 10.1109/ASRU.1997.659137
A.R. Setlur, R.A. Sukkar, J. Jacob, Correcting recognition errors via discriminative utterance verification international conference on spoken language processing. ,vol. 2, pp. 602- 605 ,(1996) , 10.1109/ICSLP.1996.607433
A. Kellner, B. Rueber, H. Schramm, Strategies for name recognition in automatic directory assistance systems Proceedings 1998 IEEE 4th Workshop Interactive Voice Technology for Telecommunications Applications. IVTTA '98 (Cat. No.98TH8376). pp. 21- 26 ,(1998) , 10.1109/IVTTA.1998.727687
A. Kellner, Initial language models for spoken dialogue systems international conference on acoustics speech and signal processing. ,vol. 1, pp. 185- 188 ,(1998) , 10.1109/ICASSP.1998.674398
M. Oerder, H. Ney, Word graphs: an efficient interface between continuous-speech recognition and language understanding IEEE International Conference on Acoustics Speech and Signal Processing. ,vol. 2, pp. 119- 122 ,(1993) , 10.1109/ICASSP.1993.319246
Gorin Parker Sachs, AL Gorin, BA Parker, RM Sachs, JG Wilpon, How may I help you Speech Communication. ,vol. 23, pp. 113- 127 ,(1997) , 10.1016/S0167-6393(97)00040-X