Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making

Xu, He; Modirshanechi, Alireza; Lehmann, Marco Philipp; Gerstner, Wulfram; Herzog, Michael

doi:10.1371/journal.pcbi.1009070

Xu, He; Modirshanechi, Alireza; Lehmann, Marco Philipp; Gerstner, Wulfram; Herzog, Michael

2021

Télécharger

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

Classic reinforcement learning (RL) theories cannot explain human behavior in the absence of external reward or when the environment changes. Here, we employ a deep sequential decision-making paradigm with sparse reward and abrupt environmental changes. To explain the behavior of human participants in these environments, we show that RL theories need to include surprise and novelty, each with a distinct role. While novelty drives exploration before the first encounter of a reward, surprise increases the rate of learning of a world-model as well as of model-free action-values. Even though the world-model is available for model-based RL, we find that human decisions are dominated by model-free action choices. The world-model is only marginally used for planning, but it is important to detect surprising events. Our theory predicts human action choices with high probability and allows us to dissociate surprise, novelty, and reward in EEG signals.

Détails

Titre Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making

Auteur(s) Xu, He ; Modirshanechi, Alireza ; Lehmann, Marco Philipp ; Gerstner, Wulfram ; Herzog, Michael

Publié dans PLoS Computational Biology

Volume 17

Numéro 6

Pages 1-32,e1009070

Date 2021-06-03

DOI https://doi.org/10.1371/journal.pcbi.1009070

Laboratoires LCN
LPSY

Le document apparaît dans Production scientifique et compétences > SV - Faculté des sciences de la vie > BMI - Institut des neurosciences > LCN - Laboratoire de calcul neuromimétique (IC/SV)
Production scientifique et compétences > I&C - Faculté Informatique & Communications > IINFCOM > LCN - Laboratoire de calcul neuromimétique (IC/SV)
Production scientifique et compétences > SV - Faculté des sciences de la vie > BMI - Institut des neurosciences > LPSY - Laboratoire de psychophysique
Publications validées par des pairs
Travail produit à l'EPFL
Articles de journaux
Publié

Grant FNS: CRSII2 147636
FNS: 200020 184615
H2020: 785907

Date de création de la notice 2021-06-11

Actions

Aperçu

Sélectionner le fichier :