Linear Bayesian Reinforcement Learning

Tziortziotis, Nikolaos; Dimitrakakis, Christos; Blekas, Konstantinos

Tziortziotis, Nikolaos; Dimitrakakis, Christos; Blekas, Konstantinos

2013

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This paper proposes a simple linear Bayesian approach to reinforcement learning. We show that with an appropriate basis, a Bayesian linear Gaussian model is sufficient for accurately estimating the system dynamics, and in particular when we allow for correlated noise. Policies are estimated by first sampling a transition model from the current posterior, and then performing approximate dynamic programming on the sampled model. This form of approximate Thompson sampling results in good exploration in unknown environments. The approach can also be seen as a Bayesian generalisation of least-squares policy iteration, where the empirical transition matrix is replaced with a sample from the posterior.

Details

Title Linear Bayesian Reinforcement Learning

Author(s) Tziortziotis, Nikolaos ; Dimitrakakis, Christos ; Blekas, Konstantinos

Published in IJCAI '13: Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Pages 1721–1728

Conference 23rd international joint conference on artififical intelligence (IJCAI 2013)

Date 2013

Laboratories LIA

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LIA - Artificial Intelligence Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2014-03-11

Files

Abstract

Details

PDF