Multi-Task Reinforcement Learning in Humans

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type , Preprint 22 Oct 2019 United States Publisher:openRxivJournal:Nature Human Behaviour, volume 5, pages 764-773 (eissn: 2397-3374,

Copyright policy )Funded by:NSF | A Center for Brains, Mind...

Authors: Momchil S. Tomov; Eric Schulz; Samuel J. Gershman;

doi: 10.1101/815332 , 10.1038/s41562-020-01035-y

pmid: 33510391

handle: 21.11116/0000-0005-D552-E

Multi-Task Reinforcement Learning in Humans

- Summary
- Subjects
- Metrics

Abstract

ABSTRACT The ability to transfer knowledge across tasks and generalize to novel ones is an important hallmark of human intelligence. Yet not much is known about human multi-task reinforcement learning. We study participants’ behavior in a novel two-step decision making task with multiple features and changing reward functions. We compare their behavior to two state-of-the-art algorithms for multi-task reinforcement learning, one that maps previous policies and encountered features to new reward functions and one that approximates value functions across tasks, as well as to standard model-based and model-free algorithms. Across three exploratory experiments and a large preregistered experiment, our results provide strong evidence for a strategy that maps previously learned policies to novel scenarios. These results enrich our understanding of human reinforcement learning in complex environments with changing task demands.

Country

United States

Related Organizations

Max Planck Society
Germany
Massachusetts Institute of Technology
United States
Harvard Medical School
United States
Max Planck Institute for Biological Cybernetics
Germany
Department of Psychology Harvard University
United States

View all View all

Keywords

Adult, Male, Transfer, Psychology, Decision Making, 150, Middle Aged, Models, Theoretical, 004, Young Adult, Humans, Learning, Female, Reinforcement, Psychology

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	48
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%