Application of a unifying reward-prediction error (RPE)-based framework to explain underlying dynamic dopaminergic activity in timing tasks

作者: Allison E. Hamilos , John A. Assad

DOI: 10.1101/2020.06.03.128272

关键词:

摘要: This manuscript is intended as a theoretical companion to Hamilos et al., 2020, in which we examined the role of dopaminergic neurons (DANs) self-timed movements. In that study, recorded DAN signals mice trained initiate licking movement after delay following start-timing cue. both before cue and during timing interval predicted onset, up seconds itself. particular, "ramped up" from time movement. On given trial, slope ramping was predictive when would occur, with steep associated early shallow late movement, reminiscent ramp-to-threshold process. Ramping were recently proposed framework temporal-difference learning under resolved state uncertainty (Mikhael 2019; Mikhael & Gershman, 2014). Here, show an adapted version al.9s model recapitulates signaling observed our task. We also applied results reported recent temporal bisection categorized intervals relatively short or long compared criterion (Soares 2016). The successfully relative amplitude dynamic These combined suggest common neural mechanism broadly underlies behavior: trial-by-trial variation rate internal "pacemaker," manifested reflect stretching compression derivative subjective value function veridical time. this view, faster pacemaking high signaling, whereas slower low levels signaling.

参考文章(28)
Gaby Maimon, John A Assad, A cognitive signal for the proactive timing of action in macaque LIP Nature Neuroscience. ,vol. 9, pp. 948- 955 ,(2006) , 10.1038/NN1716
Victoria Wochna Loerzel, Norma Conner, Advances and Challenges: Student Reflections From an Online Death and Dying Course American Journal of Hospice and Palliative Medicine. ,vol. 33, pp. 8- 15 ,(2016) , 10.1177/1049909114549182
Gustavo B.M. Mello, Sofia Soares, Joseph J. Paton, A Scalable Population Code for Time in the Striatum Current Biology. ,vol. 25, pp. 1113- 1122 ,(2015) , 10.1016/J.CUB.2015.02.036
Samuel J. Gershman, Dopamine ramps are a consequence of reward prediction errors Neural Computation. ,vol. 26, pp. 467- 471 ,(2014) , 10.1162/NECO_A_00559
C. R. Schuster, J. Zimmerman, Timing behavior during prolonged treatment with dl-amphetamine. Journal of the Experimental Analysis of Behavior. ,vol. 4, pp. 327- 330 ,(1961) , 10.1901/JEAB.1961.4-327
P. B. Dews, W. H. Morse, Some observations on an operant in human subjects and its modification by dextro amphetamine. Journal of the Experimental Analysis of Behavior. ,vol. 1, pp. 359- 364 ,(1958) , 10.1901/JEAB.1958.1-359
Brian C. Rakitin, John Gibbon, Trevor B. Penney, Chara Malapani, Sean C. Hinton, Warren H. Meck, Scalar expectancy theory and peak-interval timing in humans. Journal of Experimental Psychology: Animal Behavior Processes. ,vol. 24, pp. 15- 33 ,(1998) , 10.1037/0097-7403.24.1.15
John Gibbon, Chara Malapani, Corby L Dale, C.R. Gallistel, Toward a neurobiology of temporal cognition: advances and challenges Current Opinion in Neurobiology. ,vol. 7, pp. 170- 184 ,(1997) , 10.1016/S0959-4388(97)80005-0