Simple learning rules to cope with changing environments

Groß, Roderich; Houston, Alasdair I.; Collins, Edmund J.; McNamara, John M.; Dechaume-Moncharmont, Francois-Xavier; Franks, Nigel R.

doi:10.1098/rsif.2007.1348

Groß, Roderich; Houston, Alasdair I.; Collins, Edmund J.; McNamara, John M.; Dechaume-Moncharmont, Francois-Xavier; Franks, Nigel R.

2008

Télécharger

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

We consider an agent that must choose repeatedly among several actions. Each action has a certain probability of giving the agent an energy reward, and costs may be associated with switching between actions. The agent does not know which action has the highest reward probability, and the probabilities change randomly over time. We study two learning rules that have been widely used to model decision-making processes in animals—one deterministic and one stochastic. In particular, we examine the influence of the rules’ “learning rate” on the agent’s energy gain. We compare the performance of each rule with the best performance attainable when the agent has either full knowledge or no knowledge of the environment. Over relatively short periods of time both rules are successful in enabling agents to exploit their environment. Moreover, under a range of effective learning rates, both rules are equivalent, and can be expressed by a third rule that requires the agent to select the action for which the current run of unsuccessful trials is shortest. However, the performance of both rules is relatively poor over longer periods of time, and under most circumstances no better than the performance an agent could achieve without knowledge of the environment. We propose a simple extension to the original rules that enables agents to learn about and effectively exploit a changing environment for an unlimited period of time.

Détails

Titre Simple learning rules to cope with changing environments

Auteur(s) Groß, Roderich ; Houston, Alasdair I. ; Collins, Edmund J. ; McNamara, John M. ; Dechaume-Moncharmont, Francois-Xavier ; Franks, Nigel R.

Publié dans Journal of the Royal Society Interface

Volume 5

Numéro 27

Pages 1193-1202

Date 2008

Mots-clés (libres)

decision-making; learning rules; dynamic environments; multi-armed bandit; animal behavior

DOI https://doi.org/10.1098/rsif.2007.1348

Autres identifiant(s) Afficher la publication dans Web of Science
DAR: 13088

Laboratoires LSRO

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LSRO - Laboratoire de systèmes robotiques
Publications validées par des pairs
Travail hors EPFL
Articles de journaux
Publié

Date de création de la notice 2008-02-18

Actions

Aperçu

Sélectionner le fichier :