A New Framework for Cortico-Striatal Plasticity: Behavioural Theory Meets In Vitro Data at the Reinforcement-Action Interface

作者: Kevin N. Gurney , Mark D. Humphries , Peter Redgrave

DOI: 10.1371/JOURNAL.PBIO.1002034

关键词: ReinforcementNeuroplasticityExtinction (psychology)NeuroscienceSynapseAction (philosophy)Action selectionBiologyReinforcement learningSynaptic plasticity

摘要: Operant learning requires that reinforcement signals interact with action representations at a suitable neural interface. Much evidence suggests this occurs when phasic dopamine, acting as prediction error, gates plasticity cortico-striatal synapses, and thereby changes the future likelihood of selecting action(s) coded by striatal neurons. But hypothesis faces serious challenges. First, is inexplicably complex, depending on spike timing, dopamine level, receptor type. Second, there credit assignment problem—action selection occur long before consequent signal. Third, two types output neuron have apparently opposite effects selection. Whether these factors rule out interface how they to produce unknown. We present computational framework addresses first predict expected activity over an operant task for both action-coding neuron, show co-operate promote in compete suppression extinction. Separately, we derive complete model spike-timing dependent from vitro data. then produces predicted necessary extinction task, remarkable convergence bottom-up data-driven top-down behavioural requirements theory. Moreover, complex dependencies are not only sufficient but Validating model, it can account data describing extinction, renewal, reacquisition, replicate experimental plasticity. By bridging levels between single synapse behaviour, our shows striatum acts action-reinforcement

参考文章(116)
Kevin Gurney, Nathan Lepora, Ashvin Shah, Ansgar Koene, Peter Redgrave, Action Discovery and Intrinsic Motivation: A Biologically Constrained Formalisation Intrinsically Motivated Learning in Natural and Artificial Systems. pp. 151- 181 ,(2013) , 10.1007/978-3-642-32375-1_7
James C Houk, James L Adams, Andrew G Barto, A Model of How the Basal Ganglia Generate and Use Neural Signals That Predict Reinforcement MIT Press. pp. 249- 270 ,(1994)
L. F. Abbott, Sacha B. Nelson, Synaptic plasticity: taming the beast. Nature Neuroscience. ,vol. 3, pp. 1178- 1183 ,(2000) , 10.1038/81453
Chengke Tang, Anthony P. Pawlak, Volodymyr Prokopenko, Mark O. West, Changes in activity of the striatum during formation of a motor habit European Journal of Neuroscience. ,vol. 25, pp. 1212- 1227 ,(2007) , 10.1111/J.1460-9568.2007.05353.X
Eugene M. Izhikevich, Dynamical Systems in Neuroscience ,(2006)
Leon N Cooper, Nathan Intrator, Harel Z Shouval, Brian S Blais, Theory of Cortical Plasticity ,(2004)
PR Montague, P Dayan, TJ Sejnowski, A framework for mesencephalic dopamine systems based on predictive Hebbian learning The Journal of Neuroscience. ,vol. 16, pp. 1936- 1947 ,(1996) , 10.1523/JNEUROSCI.16-05-01936.1996
Anton Ilango, Andrew J. Kesner, Kristine L. Keller, Garret D. Stuber, Antonello Bonci, Satoshi Ikemoto, Similar roles of substantia nigra and ventral tegmental dopamine neurons in reward and aversion. The Journal of Neuroscience. ,vol. 34, pp. 817- 822 ,(2014) , 10.1523/JNEUROSCI.1703-13.2014