The Dialog State Tracking Challenge

作者: Antoine Raux , Alan Black , Jason Williams , Deepak Ramachandran

DOI:

关键词:

摘要: In a spoken dialog system, state tracking deduces information about the user’s goal as progresses, synthesizing evidence such acts over multiple turns with external data sources. Recent approaches have been shown to overcome ASR and SLU errors in some applications. However, there are currently no common testbeds or evaluation measures for this task, hampering progress. The challenge seeks address by providing heterogeneous corpus of 15K human-computer dialogs standard format, along suite 11 metrics. received total 27 entries from 9 research groups. results show that performance metrics cluster into 4 natural Moreover, systems benefit most those less discriminative speech recognition confidence scores. Finally, generalization is key problem: 2 test sets, fewer than half out-performed simple baselines. 1 Overview motivation Spoken interact users via language help them achieve goal. As interaction manager maintains representation process called (DST). For example, bus schedule might indicate desired route, origin, destination. Dialog difficult because automatic ∗Most work was performed when second third authors were Honda Research Institute, Mountain View, CA, USA (ASR) understanding (SLU) common, can cause system misunderstand needs. At same time, crucial relies on estimated choose actions – which present user. Most commercial use hand-crafted heuristics tracking, selecting result highest score, discarding alternatives. contrast, statistical compute scores many hypotheses (Figure 1). By exploiting correlations between sources maps, timetables, models past errors. Numerous techniques proposed, including heuristic (Higashinaka et al., 2003), Bayesian networks (Paek Horvitz, 2000; Williams Young, 2007), kernel density estimators (Ma 2012), (Bohus Rudnicky, 2006). Techniques fielded scale realistically sized problems operate real time (Young 2010; Thomson Williams, Mehta 2010). end-to-end systems, has improve overall Despite progress, direct comparisons methods not possible studies different domains components, recognition, understanding, control, etc. little agreement how evaluate tracking. Together these issues limit progress area. State Tracking Challenge (DSTC) provides first testbed

参考文章(14)
Antoine Raux, Neville Mehta, Stefan Krawczyk, Rakesh Gupta, Deepak Ramachandran, Probabilistic Ontology Trees for Belief Tracking in Dialog Systems annual meeting of the special interest group on discourse and dialogue. pp. 37- 46 ,(2010)
Antoine Raux, Rakesh Gupta, Yi Ma, Deepak Ramachandran, Landmark-Based Location Belief Tracking in a Spoken Dialog System annual meeting of the special interest group on discourse and dialogue. pp. 169- 178 ,(2012)
Alexander I Rudnicky, Dan Bohus, A “K Hypotheses + Other” Belief Updating Model AAAI Press. ,(2006)
Eric Horvitz, Tim Paek, Conversation as Action Under Uncertainty uncertainty in artificial intelligence. pp. 455- 464 ,(2000)
Alan W Black, Susanne Burger, Brian Langner, Gabriel Parent, Maxine Eskenazi, Spoken Dialog Challenge 2010 spoken language technology workshop. pp. 448- 453 ,(2010) , 10.1109/SLT.2010.5700894
M. G. KENDALL, A NEW MEASURE OF RANK CORRELATION Biometrika. ,vol. 30, pp. 81- 93 ,(1938) , 10.1093/BIOMET/30.1-2.81
Jason D. Williams, Incremental partition recombination for efficient tracking of multiple dialog states international conference on acoustics, speech, and signal processing. pp. 5382- 5385 ,(2010) , 10.1109/ICASSP.2010.5494939
Steve Young, Milica Gašić, Simon Keizer, François Mairesse, Jost Schatzmann, Blaise Thomson, Kai Yu, The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management Computer Speech & Language. ,vol. 24, pp. 150- 174 ,(2010) , 10.1016/J.CSL.2009.04.001
Blaise Thomson, Steve Young, Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems Computer Speech & Language. ,vol. 24, pp. 562- 588 ,(2010) , 10.1016/J.CSL.2009.07.003
Ryuichiro Higashinaka, Mikio Nakano, Kiyoaki Aikawa, Corpus-based discourse understanding in spoken dialogue systems Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - ACL '03. pp. 240- 247 ,(2003) , 10.3115/1075096.1075127