Implications of Human Irrationality for Reinforcement Learning

作者: Hyung Jin Chang , Andrew Howes , Haiyang Chen

DOI:

关键词:

摘要: Recent work in the behavioural sciences has begun to overturn the long-held belief that human decision making is irrational, suboptimal and subject to biases. This turn to the …

参考文章(63)
Warren S. McCulloch, Walter Pitts, A logical calculus of the ideas immanent in nervous activity Bulletin of Mathematical Biology. ,vol. 52, pp. 99- 115 ,(1990) , 10.1007/BF02478259
Ivo Vlaev, Nick Chater, Neil Stewart, Gordon D.A. Brown, Does the brain calculate value? Trends in Cognitive Sciences. ,vol. 15, pp. 546- 554 ,(2011) , 10.1016/J.TICS.2011.09.008
P. Dayan, N. D. Daw, Decision theory, reinforcement learning, and the brain Cognitive, Affective, & Behavioral Neuroscience. ,vol. 8, pp. 429- 453 ,(2008) , 10.3758/CABN.8.4.429
Rajesh P. N. Rao, Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes Frontiers in Computational Neuroscience. ,vol. 4, pp. 146- 146 ,(2010) , 10.3389/FNCOM.2010.00146
Jennifer S. Trueblood, Multialternative context effects obtained using an inference task Psychonomic Bulletin & Review. ,vol. 19, pp. 962- 968 ,(2012) , 10.3758/S13423-012-0288-9
Amos Tversky, Itamar Simonson, Context-Dependent Preferences Management Science. ,vol. 39, pp. 1179- 1189 ,(1993) , 10.1287/MNSC.39.10.1179
Nathaniel D. Daw, Aaron C. Courville, David S. Touretzky, Representation and timing in theories of the dopamine system Neural Computation. ,vol. 18, pp. 1637- 1677 ,(2006) , 10.1162/NECO.2006.18.7.1637
Douglas H. Wedell, Distinguishing Among Models of Contextually Induced Preference Reversals Journal of Experimental Psychology: Learning, Memory and Cognition. ,vol. 17, pp. 767- 778 ,(1991) , 10.1037/0278-7393.17.4.767
Peter Frazier, Angela J Yu, Sequential Hypothesis Testing under Stochastic Deadlines neural information processing systems. ,vol. 20, pp. 465- 472 ,(2007)