Hierarchical Reinforcement Learning for Pedagogical Policy Induction

作者: Guojing Zhou , Hamoon Azizsoltani , Markel Sanz Ausin , Tiffany Barnes , Min Chi

DOI: 10.1007/978-3-030-23204-7_45

关键词:

摘要: In interactive e-learning environments such as Intelligent Tutoring Systems, there are pedagogical decisions to make at two main levels of granularity: whole problems and single steps. Recent years have seen growing interest in data-driven techniques for decision making, which can dynamically tailor students’ learning experiences. Most existing approaches, however, treat these equally, or independently, disregarding the long-term impact that tutor may across granularity. this paper, we propose apply an offline, off-policy Gaussian Processes based Hierarchical Reinforcement Learning (HRL) framework induce a hierarchical policy makes both problem step levels. empirical classroom study with 180 students, our results show HRL is significantly more effective than Deep Q-Network (DQN) induced random yet reasonable baseline policy.

参考文章(42)
Martha Evens, Joel Michael, One-on-One Tutoring by Humans and Computers Psychology Press. ,(2006) , 10.4324/9781410617071
Bruce M. McLaren, Seiji Isotani, When is it best to learn with all worked examples artificial intelligence in education. pp. 222- 229 ,(2011) , 10.1007/978-3-642-21869-9_30
Amir Shareghi Najar, Antonija Mitrovic, Bruce M. McLaren, Adaptive Support versus Alternating Worked Examples and Tutored Problems: Which Leads to Better Learning? international conference on user modeling, adaptation, and personalization. pp. 171- 182 ,(2014) , 10.1007/978-3-319-08786-3_15
Malcolm R. K. Ryan, Mark Reid, Learning to Fly: An Application of Hierarchical Reinforcement Learning international conference on machine learning. pp. 807- 814 ,(2000)
Bruce M. McLaren, Tamara van Gog, Craig Ganoe, David Yaron, Michael Karabinos, Exploring the assistance dilemma: Comparing instructional support in examples and problems intelligent tutoring systems. pp. 354- 361 ,(2014) , 10.1007/978-3-319-07221-0_44
Carl Edward Rasmussen, Gaussian processes in machine learning Lecture Notes in Computer Science. pp. 63- 71 ,(2003) , 10.1007/978-3-540-28650-9_4
Beverly Park Woolf, Carole R. Beal, Joseph Beck, Advisor: a machine-learning architecture for intelligent tutor construction national conference on artificial intelligence. pp. 552- 557 ,(2000)
Computers as Cognitive Tools L. Erlbaum Associates Inc.. ,(1993) , 10.4324/9780203052594
Andrew G. Barto, Sridhar Mahadevan, Recent Advances in Hierarchical Reinforcement Learning Discrete Event Dynamic Systems. ,vol. 13, pp. 41- 77 ,(2003) , 10.1023/A:1022140919877
Marvin Croy, Michael Eagle, Tiffany Barnes, John C. Stamper, Experimental evaluation of automatic hint generation for a logic tutor artificial intelligence in education. ,vol. 22, pp. 345- 352 ,(2011) , 10.3233/JAI-130029