Statistical methods in natural language understanding and spoken dialogue systems

作者: Hermann Ney , Klaus Macherey

DOI:

关键词: Formal languageNatural language processingComputer scienceArtificial intelligenceMachine translationGenerative modelRule-based machine translationFeature (machine learning)Word error rateNatural language understandingSentence

摘要: Modern automatic spoken dialogue systems cover a wide range of applications. There are for hotel reservations, restaurant guides, travel and timetable information, as well telephone-banking services. Building the different components system combining them in an optimal way such that reasonable becomes possible is complex task because during course dialogue, has to deal with uncertain information. In this thesis, we use statistical methods model combine system’s components. Statistical provide well-founded theory modeling where decisions have be made under uncertainty. Starting from Bayes’ decision rule, define evaluate various models these components, which comprise speech recognition, natural language understanding, management. The problem understanding described special machine translation source sentence translated into formal target consisting concepts. For this, two models. first generative based on source-channel paradigm. Because word context plays important role tasks, phrasebased order take local dependencies account. second direct maximum entropy framework works similar tagger. model, several feature functions capture between words Both advantage only source-target pairs form input-output sentences must provided training. Thus, there no need generate grammars manually, significantly reduces costs building new domains. Furthermore, propose investigate minimum error rate training results tighter coupling recognition understanding. This allows easy integration multiple knowledge sources by minimizing overall criterion. it add features thus minimize rate, or slot rate. Finally, develop task-independent manager using trees fundamental data structure. Based cost function, chooses next action minimal costs. design task-independence leads strict separation given application operations performed manager, simplifies porting existing domain. We report field test was able choose 90% decisions. techniques handling confidence measures defined performance when incorporated strategy. Experiments been carried out TelDir database, German in-house telephone directory assistance corpus, Taba train time scheduling task.

参考文章(80)
George M. Ferguson, James F. Allen, Brad W. Miller, Eric K. Ringger, The Design and Implementation of the TRAINS-96 System: A Prototype Mixed-Initiative Planning Assistant Defense Technical Information Center. ,(1996) , 10.21236/ADA329931
Hélène Bonneau-Maynard, Wolfgang Minker, Samir Bennacef, Lori Lamel, Jean-Luc Gauvain, A spoken language system for information retrieval. conference of the international speech communication association. ,(1994)
Thomas Kemp, Thomas Schaaf, Estimating confidence using word lattices. conference of the international speech communication association. ,(1997)
Egbert Ammicht, Alexandros Potamianos, Hong-Kwang Jeff Kuo, Dialogue management in the Bell Labs communicator system. conference of the international speech communication association. pp. 603- 606 ,(2000)
Katsuhito Sudoh, Hajime Tsukada, Tightly integrated spoken language understanding using word-to-concept translation. conference of the international speech communication association. pp. 429- 432 ,(2005)
Roberto Pieraccini, Esther Levin, Concept-based spontaneous speech understanding system. conference of the international speech communication association. ,(1995)
Jeremy Peckham, Speech Understanding and Dialogue over the telephone: an overview of the ESPRIT SUNDIAL project. Speech and Natural Language: Proceedings of a Workshop Held at Pacific Grove, California, February 19-22, 1991. ,(1991)
Esther Levin, Roberto Pieraccini, Wieland Eckert, A stochastic model of computer-human interaction for learning dialogue strategies. conference of the international speech communication association. ,(1997)
Kazunori Komatani, Tatsuya Kawahara, Generating effective confirmation and guidance using two-level confidence measures for dialogue systems. conference of the international speech communication association. pp. 648- ,(2000)
Nicola Ueffing, Word confidence measures for machine translation Publikationsserver der RWTH Aachen University. pp. 1- 146 ,(2006)