Representation learning with reward prediction errors

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 26 Jul 2022Embargo end date: 01 Jan 2021 English Publisher:The Neurons Behavior Data Analysis and Theory collectiveJournal:Neurons, Behavior, Data analysis, and Theory, volume 1 (eissn: 2690-2664,

Copyright policy )

Authors: Alexander, William H.; Gershman, Samuel J.;

doi: 10.51628/001c.37270 , 10.48550/arxiv.2108.12402

arXiv: 2108.12402

Representation learning with reward prediction errors

- Summary
- Subjects
- Metrics

Abstract

The Reward Prediction Error hypothesis proposes that phasic activity in the midbrain dopaminergic system reflects prediction errors needed for learning in reinforcement learning. Besides the well-documented association between dopamine and reward processing, dopamine is implicated in a variety of functions without a clear relationship to reward prediction error. Fluctuations in dopamine levels influence the subjective perception of time, dopamine bursts precede the generation of motor responses, and the dopaminergic system innervates regions of the brain, including hippocampus and areas in prefrontal cortex, whose function is not uniquely tied to reward. In this manuscript, we propose that a common theme linking these functions is representation, and that prediction errors signaled by the dopamine system, in addition to driving associative learning, can also support the acquisition of adaptive state representations. In a series of simulations, we show how this extension can account for the role of dopamine in temporal and spatial representation, motor response, and abstract categorization tasks. By extending the role of dopamine signals to learning state representations, we resolve a critical challenge to the Reward Prediction Error hypothesis of dopamine function.

Related Organizations

Florida Atlantic University
United States
Harvard University
United States

Keywords

Quantitative Biology - Neurons and Cognition, FOS: Biological sciences, Neurons and Cognition (q-bio.NC)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green

gold