Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Preprint . 2026
License: CC BY
Data sources: ZENODO
ZENODO
Preprint . 2026
License: CC BY
Data sources: Datacite
ZENODO
Preprint . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Conditionally Stable Motivation: A Neurocomputational Model of Long-Horizon Goal Persistence under Extreme Reward Sparsity

Authors: Sulin, Zhang;

Conditionally Stable Motivation: A Neurocomputational Model of Long-Horizon Goal Persistence under Extreme Reward Sparsity

Abstract

Long-horizon human pursuits (e.g., lifelong projects, scientific careers, entrepreneurship, elite training) sometimes show a striking pattern: persistence with low or zero immediate reward for extended periods, punctuated by sudden, intense bursts of goal-directed activity when opportunities arise. Classical reinforcement learning (RL) and canonical motivational theories — which emphasize reward prediction errors, temporal discounting, and reward-driven value updating — struggle to capture this “enduring yet conditional” persistence. Here we formalize and defend a neurocomputational hypothesis we call Conditionally Stable Motivation (CSM). CSM posits that (i) certain high-order goals are encoded as stable latent value attractors (state components, not ephemeral reward signals), (ii) a latent opportunity set mediates whether the goal’s motivational potential remains active, and (iii) neural circuits implement a two-mode control policy (maintenance vs. exploitation) governed by the opportunity set, with dopaminergic signals acting primarily as opportunity indicators rather than pure reward prediction errors in this regime. We present a precise mathematical model (state augmentation, motivational potential, policy switching rules), map model components to plausible neural substrates (vmPFC, dlPFC, ACC, ventral striatum/VTA), derive empirical predictions, and outline experiments and simulation paradigms for validation. We argue that CSM (a) reconciles long-horizon persistence with sparse rewards, (b) makes falsifiable neurophysiological predictions distinct from standard RL, and (c) provides a framework for understanding both adaptive persistence and pathological forms of unyielding pursuit. Keywords: long-horizon goals, motivation, vmPFC, dopamine, reinforcement learning, opportunity space, hierarchical control, theoretical neuroscience

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average