Neural systems for choice and valuation with counterfactual learning signals

M J Tobia; R Guo; U Schwarze; W Boehmer; J Gläscher; B Finckh; A Marschner; C Büchel; K Obermayer; Tobias Sommer-Blöchl

doi:10.1016/j.neuroimage.2013.11.051

Neural systems for choice and valuation with counterfactual learning signals

Standard

Neural systems for choice and valuation with counterfactual learning signals. / Tobia, M J; Guo, R; Schwarze, U; Boehmer, W; Gläscher, J; Finckh, B; Marschner, A; Büchel, C; Obermayer, K; Sommer-Blöchl, Tobias.

in: NEUROIMAGE, Jahrgang 89, 01.04.2014, S. 57-69.

Publikationen: SCORING: Beitrag in Fachzeitschrift/Zeitung › SCORING: Zeitschriftenaufsatz › Forschung › Begutachtung

Harvard

Tobia, MJ, Guo, R, Schwarze, U, Boehmer, W, Gläscher, J, Finckh, B, Marschner, A, Büchel, C, Obermayer, K & Sommer-Blöchl, T 2014, 'Neural systems for choice and valuation with counterfactual learning signals', NEUROIMAGE, Jg. 89, S. 57-69. https://doi.org/10.1016/j.neuroimage.2013.11.051

APA

Tobia, M. J., Guo, R., Schwarze, U., Boehmer, W., Gläscher, J., Finckh, B., Marschner, A., Büchel, C., Obermayer, K., & Sommer-Blöchl, T. (2014). Neural systems for choice and valuation with counterfactual learning signals. NEUROIMAGE, 89, 57-69. https://doi.org/10.1016/j.neuroimage.2013.11.051

Vancouver

Tobia MJ, Guo R, Schwarze U, Boehmer W, Gläscher J, Finckh B et al. Neural systems for choice and valuation with counterfactual learning signals. NEUROIMAGE. 2014 Apr 1;89:57-69. https://doi.org/10.1016/j.neuroimage.2013.11.051

Bibtex

@article{76c4a6e9b968457eb49bfee9c9884c90,

title = "Neural systems for choice and valuation with counterfactual learning signals",

abstract = "The purpose of this experiment was to test a computational model of reinforcement learning with and without fictive prediction error (FPE) signals to investigate how counterfactual consequences contribute to acquired representations of action-specific expected value, and to determine the functional neuroanatomy and neuromodulator systems that are involved. 80 male participants underwent dietary depletion of either tryptophan or tyrosine/phenylalanine to manipulate serotonin (5HT) and dopamine (DA), respectively. They completed 80 rounds (240 trials) of a strategic sequential investment task that required accepting interim losses in order to access a lucrative state and maximize long-term gains, while being scanned. We extended the standard Q-learning model by incorporating both counterfactual gains and losses into separate error signals. The FPE model explained the participants' data significantly better than a model that did not include counterfactual learning signals. Expected value from the FPE model was significantly correlated with BOLD signal change in the ventromedial prefrontal cortex (vmPFC) and posterior orbitofrontal cortex (OFC), whereas expected value from the standard model did not predict changes in neural activity. The depletion procedure revealed significantly different neural responses to expected value in the vmPFC, caudate, and dopaminergic midbrain in the vicinity of the substantia nigra (SN). Differences in neural activity were not evident in the standard Q-learning computational model. These findings demonstrate that FPE signals are an important component of valuation for decision making, and that the neural representation of expected value incorporates cortical and subcortical structures via interactions among serotonergic and dopaminergic modulator systems.",

author = "Tobia, {M J} and R Guo and U Schwarze and W Boehmer and J Gl{\"a}scher and B Finckh and A Marschner and C B{\"u}chel and K Obermayer and Tobias Sommer-Bl{\"o}chl",

year = "2014",

month = apr,

day = "1",

doi = "10.1016/j.neuroimage.2013.11.051",

language = "English",

volume = "89",

pages = "57--69",

journal = "NEUROIMAGE",

issn = "1053-8119",

publisher = "Academic Press",

}

RIS

TY - JOUR

T1 - Neural systems for choice and valuation with counterfactual learning signals

AU - Tobia, M J

AU - Guo, R

AU - Schwarze, U

AU - Boehmer, W

AU - Gläscher, J

AU - Finckh, B

AU - Marschner, A

AU - Büchel, C

AU - Obermayer, K

AU - Sommer-Blöchl, Tobias

PY - 2014/4/1

Y1 - 2014/4/1

N2 - The purpose of this experiment was to test a computational model of reinforcement learning with and without fictive prediction error (FPE) signals to investigate how counterfactual consequences contribute to acquired representations of action-specific expected value, and to determine the functional neuroanatomy and neuromodulator systems that are involved. 80 male participants underwent dietary depletion of either tryptophan or tyrosine/phenylalanine to manipulate serotonin (5HT) and dopamine (DA), respectively. They completed 80 rounds (240 trials) of a strategic sequential investment task that required accepting interim losses in order to access a lucrative state and maximize long-term gains, while being scanned. We extended the standard Q-learning model by incorporating both counterfactual gains and losses into separate error signals. The FPE model explained the participants' data significantly better than a model that did not include counterfactual learning signals. Expected value from the FPE model was significantly correlated with BOLD signal change in the ventromedial prefrontal cortex (vmPFC) and posterior orbitofrontal cortex (OFC), whereas expected value from the standard model did not predict changes in neural activity. The depletion procedure revealed significantly different neural responses to expected value in the vmPFC, caudate, and dopaminergic midbrain in the vicinity of the substantia nigra (SN). Differences in neural activity were not evident in the standard Q-learning computational model. These findings demonstrate that FPE signals are an important component of valuation for decision making, and that the neural representation of expected value incorporates cortical and subcortical structures via interactions among serotonergic and dopaminergic modulator systems.

AB - The purpose of this experiment was to test a computational model of reinforcement learning with and without fictive prediction error (FPE) signals to investigate how counterfactual consequences contribute to acquired representations of action-specific expected value, and to determine the functional neuroanatomy and neuromodulator systems that are involved. 80 male participants underwent dietary depletion of either tryptophan or tyrosine/phenylalanine to manipulate serotonin (5HT) and dopamine (DA), respectively. They completed 80 rounds (240 trials) of a strategic sequential investment task that required accepting interim losses in order to access a lucrative state and maximize long-term gains, while being scanned. We extended the standard Q-learning model by incorporating both counterfactual gains and losses into separate error signals. The FPE model explained the participants' data significantly better than a model that did not include counterfactual learning signals. Expected value from the FPE model was significantly correlated with BOLD signal change in the ventromedial prefrontal cortex (vmPFC) and posterior orbitofrontal cortex (OFC), whereas expected value from the standard model did not predict changes in neural activity. The depletion procedure revealed significantly different neural responses to expected value in the vmPFC, caudate, and dopaminergic midbrain in the vicinity of the substantia nigra (SN). Differences in neural activity were not evident in the standard Q-learning computational model. These findings demonstrate that FPE signals are an important component of valuation for decision making, and that the neural representation of expected value incorporates cortical and subcortical structures via interactions among serotonergic and dopaminergic modulator systems.

U2 - 10.1016/j.neuroimage.2013.11.051

DO - 10.1016/j.neuroimage.2013.11.051

M3 - SCORING: Journal article

C2 - 24321554

VL - 89

SP - 57

EP - 69

JO - NEUROIMAGE

JF - NEUROIMAGE

SN - 1053-8119

ER -