A tractable hybrid ddn–pomdp approach to affective dialogue modeling for probabilistic frame-based dialogue systems

作者: TRUNG H. BUI , MANNES POEL , ANTON NIJHOLT , JOB ZWIERS

DOI: 10.1017/S1351324908005032

关键词: Domain (software engineering)Action (philosophy)Probabilistic logicComputer scienceCrisis managementArtificial intelligenceState (computer science)Frame (networking)Frame basedPartially observable Markov decision process

摘要: We propose a novel approach to developing tractable affective dialogue model for probabilistic frame-based systems. The model, based on Partially Observable Markov Decision Process (POMDP) and Dynamic Network (DDN) techniques, is composed of two main parts: the slot-level manager global manager. It has new features: (1) being able deal with large number slots (2) take into account some aspects user's state in deriving adaptive strategies. Our implemented prototype can handle hundreds slots, where each individual slot might have values. illustrated through route navigation example crisis management domain. conducted various experiments evaluate our compare it approximate POMDP techniques handcrafted policies. experimental results showed that DDN–POMDP policy outperforms three policies when action error induced by stress as well observation increases. Further, performance one-step look-ahead after optimizing its internal reward close state-of-the-art counterparts.

参考文章(50)
Thomas Dean, Keiji Kanazawa, A model for projection and action international joint conference on artificial intelligence. pp. 985- 990 ,(1989)
Alexander Waibel, Thomas S. Polzin, EMOTION-SENSITIVE HUMAN-COMPUTER INTERFACES ,(2000)
S Young, JD Williams, Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI) Association for the Advancement of Artificial Intelligence. ,(2006)
David R. Traum, Staffan Larsson, The Information State Approach to Dialogue Management Springer, Dordrecht. pp. 325- 353 ,(2003) , 10.1007/978-94-010-0019-2_15
Craig Boutilier, David Poole, Computing optimal policies for partially observable decision processes using compact representations national conference on artificial intelligence. pp. 1168- 1175 ,(1996)
Eric Chang, Baining Guo, Jianfeng Mao, Qingsheng Cai, Bo Zhang, Spoken Dialogue Management as Planning and Acting under Uncertainty conference of the international speech communication association. pp. 2169- 2172 ,(2001)
Damien Ernst, Arthur Louette, Introduction to Reinforcement Learning MIT Press. ,(1998)
Sebastian Thrun, Joelle Pineau, Geoff Gordon, Point-based value iteration: an anytime algorithm for POMDPs international joint conference on artificial intelligence. pp. 1025- 1030 ,(2003)
Michael L. Littman, Anthony R. Cassandra, Leslie Pack Kaelbling, Learning policies for partially observable environments: Scaling up Machine Learning Proceedings 1995. pp. 362- 370 ,(1995) , 10.1016/B978-1-55860-377-6.50052-9