Functional role of opponent, dopamine modulated D1/D2 plasticity in prediction error-driven reinforcement learning in the basal ganglia

Jitsev, Jenia; Morrison, Abigail; Abraham, Nobi; Tittgemeyer, Marc

doi:10.12751/nncn.bc2013.0164

Items
Marc 21

001			141530
005			20240313094906.0
024	7	_	\|a 10.12751/nncn.bc2013.0164 \|2 doi
037	_	_	\|a FZJ-2013-06698
041	_	_	\|a eng
100	1	_	\|a Jitsev, Jenia \|0 P:(DE-Juel1)158080 \|b 0 \|e Corresponding author \|u fzj
111	2	_	\|a Berstein Conference on Computational Neuroscience \|c Tuebingen \|d 2013-09-24 - 2013-09-27 \|w Germany
245	_	_	\|a Functional role of opponent, dopamine modulated D1/D2 plasticity in prediction error-driven reinforcement learning in the basal ganglia
260	_	_	\|c 2013 \|b G-Node
300	_	_	\|a 162 - 163
336	7	_	\|a CONFERENCE_PAPER \|2 ORCID
336	7	_	\|a Conference Paper \|0 33 \|2 EndNote
336	7	_	\|a INPROCEEDINGS \|2 BibTeX
336	7	_	\|a conferenceObject \|2 DRIVER
336	7	_	\|a Output Types/Conference Paper \|2 DataCite
336	7	_	\|a Contribution to a conference proceedings \|b contrib \|m contrib \|0 PUB:(DE-HGF)8 \|s 1568970454_27324 \|2 PUB:(DE-HGF)
520	_	_	\|a Here, we introduce a spiking actor-critic network model of learning from both reward and punishment in the basal ganglia. Both the dorsal (actor) and ventral (critic) striatum are assumed to contain populations of D1 and D2 medium spiny neurons (MSNs). In the ventral striatum, this allows separate representation of both positive and negative expected outcomes by respective D1/D2 MSN populations, which we hypothesize to reside in the shell part of the Nucleus Accumbens. The positive and negative outcome expectations are fed to dopamine (DA) neurons in VTA region, which compute and signal total prediction error by DA release. Based on recent experimental work [1], DA level is assumed to modulate plasticity of D1 and D2 synapses in opposing way, inducing LTP on D1 and LTD on D2 synapses if being high and vice versa if being low. Crucially, this form of opponent plasticity implements temporal-difference (TD)-like update of both positive and negative outcome expectations and performs appropriate adaptation of action preferences.We implemented the network in the NEST simulator [2] using leaky integrate-and-fire spiking neurons, and designed a battery of experiments in various grid world tasks. Across the tasks the network can learn both to approach the delayed rewards while consequently avoiding punishments, which posed severe difficulties for the previous model without D1/D2 segregation [3]. The model highlights thus the functional role of D1/D2 MSN segregation within the striatum in implementing appropriate TD-like learning from both reward and punishment and explains necessity for opponent direction of DA-dependent plasticity found at synapses converging on distinct striatal MSN types. The approach can be further extended to study how abnormal D1/D2 plasticity may lead to a reorganization of the basal ganglia network towards pathological, dysfunctional states, like for instance those observed in Parkinson disease under condition of progressive dopamine depletion.
536	_	_	\|a 311 - Signaling pathways, cell and tumor biology (POF2-311) \|0 G:(DE-HGF)POF2-311 \|c POF2-311 \|f POF II \|x 0
536	_	_	\|a HASB - Helmholtz Alliance on Systems Biology (HGF-SystemsBiology) \|0 G:(DE-Juel1)HGF-SystemsBiology \|c HGF-SystemsBiology \|f HASB-2008-2012 \|x 1
536	_	_	\|a SMHB - Supercomputing and Modelling for the Human Brain (HGF-SMHB-2013-2017) \|0 G:(DE-Juel1)HGF-SMHB-2013-2017 \|c HGF-SMHB-2013-2017 \|f SMHB \|x 2
536	_	_	\|a W2Morrison - W2/W3 Professorinnen Programm der Helmholtzgemeinschaft (B1175.01.12) \|0 G:(DE-HGF)B1175.01.12 \|c B1175.01.12 \|x 3
588	_	_	\|a Dataset connected to DataCite
700	1	_	\|a Abraham, Nobi \|0 P:(DE-HGF)0 \|b 1
700	1	_	\|a Tittgemeyer, Marc \|0 P:(DE-HGF)0 \|b 2
700	1	_	\|a Morrison, Abigail \|0 P:(DE-Juel1)151166 \|b 3 \|u fzj
773	_	_	\|a 10.12751/nncn.bc2013.0164
856	4	_	\|u https://portal.g-node.org/abstracts/bc13/#/doi/nncn.bc2013.0164
909	C	O	\|p VDB \|o oai:juser.fz-juelich.de:141530
910	1	_	\|a Forschungszentrum Jülich GmbH \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 0 \|6 P:(DE-Juel1)158080
910	1	_	\|a Forschungszentrum Jülich GmbH \|0 I:(DE-588b)5008462-8 \|k FZJ \|b 3 \|6 P:(DE-Juel1)151166
913	1	_	\|a DE-HGF \|b Gesundheit \|l Krebsforschung \|1 G:(DE-HGF)POF2-310 \|0 G:(DE-HGF)POF2-311 \|2 G:(DE-HGF)POF2-300 \|v Signaling pathways, cell and tumor biology \|x 0 \|4 G:(DE-HGF)POF \|3 G:(DE-HGF)POF2
914	1	_	\|y 2013
920	1	_	\|0 I:(DE-Juel1)INM-6-20090406 \|k INM-6 \|l Computational and Systems Neuroscience \|x 0
920	1	_	\|0 I:(DE-Juel1)IAS-6-20130828 \|k IAS-6 \|l Theoretical Neuroscience \|x 1
980	_	_	\|a contrib
980	_	_	\|a VDB
980	_	_	\|a I:(DE-Juel1)INM-6-20090406
980	_	_	\|a I:(DE-Juel1)IAS-6-20130828
980	_	_	\|a UNRESTRICTED
981	_	_	\|a I:(DE-Juel1)IAS-6-20130828
981	_	_	\|a I:(DE-Juel1)IAS-6-20130828

Library	Collection	CLSMajor	CLSMinor	Language	Author

Marc 21

Gast :: Anmelden JuSER
		Suchen		Absenden		Personalisieren Ihre Benachrichtigungen Ihre Körbe Ihre Suchanfragen		Hilfe