Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture

descriptionPublicationkeyboard_double_arrow_right Article 01 Sep 2021 English Publisher:WileyJournal:Topics in Cognitive Science, volume 14, pages 756-779 (issn: 1756-8757, eissn: 1756-8765,

Copyright policy )

Authors: Konstantinos Mitsopoulos; Sterling Somers; Joel Schooler; Christian Lebiere; Peter Pirolli; Robert Thomson 0001;

doi: 10.1111/tops.12573

pmid: 34467649

Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture

- Summary
- Subjects
- Metrics

Abstract

AbstractWe argue that cognitive models can provide a common ground between human users and deep reinforcement learning (Deep RL) algorithms for purposes of explainable artificial intelligence (AI). Casting both the human and learner as cognitive models provides common mechanisms to compare and understand their underlying decision‐making processes. This common grounding allows us to identify divergences and explain the learner's behavior in human understandable terms. We present novel salience techniques that highlight the most relevant features in each model's decision‐making, as well as examples of this technique in common training environments such as Starcraft II and an OpenAI gridworld.

Related Organizations

Florida Institute for Human and Machine Cognition
United States
Carnegie Mellon University
United States
United States Military Academy
United States

Keywords

Cognition, Artificial Intelligence, Humans, Reinforcement, Psychology, Algorithms

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	11
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%