Estimates of Parameter Distributions for Optimal Action Selection

Dimitrakakis, Christos; Bengio, Samy

2004

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

We present a general method for maintaining estimates of the distribution of parameters in arbitrary models. This is then applied to the estimation of probability distribution over actions in value-based reinforcement learning. While this approach is similar to other techniques that maintain a confidence measure for action-values, it nevertheless offers a new insight into current techniques and reveals potential avenues of further research.

Details

Title Estimates of Parameter Distributions for Optimal Action Selection

Author(s) Dimitrakakis, Christos ; Bengio, Samy

Date 2004

Publisher IDIAP

Keywords

learning

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2006-03-10

Abstract

Details

Actions