Conference Presentation (After Call) FZJ-2015-00365

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
Distinct plasticity mechanisms in the basal ganglia and their functional role in reinforcement learning

 ;  ;

2014

9th European Forum of Neuroscience, FENS 2014, MilanoMilano, Italy, 5 Jul 2014 - 9 Jul 20142014-07-052014-07-09

Abstract: Aiming towards a minimal realistic circuitry model of learning from both appetitive and aversive outcomes, we implemented a spiking actor-critic network model of the basal ganglia that incorporates different plasticity mechanisms and segregates both the dorsal and ventral striatum into populations of D1 and D2 medium spiny neurons (MSNs). This segregation allows explicit, separate representation of both positive and negative expected outcome of a given environmental state by the respective population of D1 or D2 MSNs that we hypothesize to reside in the shell region of Nucleus Accumbens. Based on experimental evidence, D1 and D2 MSN populations were assumed to have distinct, opposing dopamine-modulated bidirectional synaptic plasticity.We implemented the network in the NEST simulator and performed experiments involving the application of delayed reward and punishment in a grid world setting, in which a moving agent must reach a goal state while maximizing the total reward obtained. We demonstrate that the network can learn both to approach the delayed positive reward and to consequently avoid punishment. The model highlights thus the functional role of D1/D2 MSN segregation within the striatum in implementing appropriate temporal difference(TD)-like learning from both reward and punishment and explains necessity for opponent direction of DA-dependent plasticity found at synapses converging on distinct striatal MSN types. The approach can be further extended to study how abnormal D1/D2 plasticity may lead to a reorganization of the basal ganglia network towards pathological, dysfunctional states, like for instance those observed in Parkinson disease under condition of progressive dopamine depletion.


Contributing Institute(s):
  1. Computational and Systems Neuroscience (INM-6)
  2. Theoretical Neuroscience (IAS-6)
Research Program(s):
  1. 331 - Signalling Pathways and Mechanisms in the Nervous System (POF2-331) (POF2-331)
  2. 89572 - (Dys-)function and Plasticity (POF2-89572) (POF2-89572)
  3. SMHB - Supercomputing and Modelling for the Human Brain (HGF-SMHB-2013-2017) (HGF-SMHB-2013-2017)
  4. RL-BRD-J - Neural network mechanisms of reinforcement learning (BMBF-01GQ1343) (BMBF-01GQ1343)
  5. W2Morrison - W2/W3 Professorinnen Programm der Helmholtzgemeinschaft (B1175.01.12) (B1175.01.12)

Appears in the scientific report 2014
Click to display QR Code for this record

The record appears in these collections:
Dokumenttypen > Präsentationen > Konferenzvorträge
Institutssammlungen > IAS > IAS-6
Institutssammlungen > INM > INM-6
Workflowsammlungen > Öffentliche Einträge
Publikationsdatenbank

 Datensatz erzeugt am 2015-01-12, letzte Änderung am 2024-03-13


Externer link:
Volltext herunterladen
Volltext
Dieses Dokument bewerten:

Rate this document:
1
2
3
 
(Bisher nicht rezensiert)