Monte-Carlo utility estimates for Bayesian reinforcement learning

Dimitrakakis, Christos

Dimitrakakis, Christos

2013

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

This paper introduces a set of algorithms for Monte-Carlo Bayesian reinforcement learning. Firstly, Monte-Carlo estimation of upper bounds on the Bayes-optimal value function is employed to construct an optimistic policy. Secondly, gradient-based algorithms for approximate upper and lower bounds are introduced. Finally, we introduce a new class of gradient algorithms for Bayesian Bellman error minimisation. We theoretically show that the gradient methods are sound. Experimentally, we demonstrate the superiority of the upper bound method in terms of reward obtained. However, we also show that the Bayesian Bellman error method is a close second, despite its significant computational simplicity.

Détails

Titre Monte-Carlo utility estimates for Bayesian reinforcement learning

Auteur(s) Dimitrakakis, Christos

Date 2013

Laboratoires LIA

Le document apparaît dans Production scientifique et compétences > I&C - Faculté Informatique & Communications > IINFCOM > LIA - Laboratoire d'intelligence artificielle
Travail produit à l'EPFL
Rapports techniques

Date de création de la notice 2013-03-12

Actions

Aperçu

Sélectionner le fichier :