Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Reports, Documentation, and Standards
  4. Gradient estimates of return
 
report

Gradient estimates of return

Dimitrakakis, Christos
•
Bengio, Samy  
2005

The exploration-exploitation trade-off that arises when one considers simple point estimates of expected returns no longer appears when full distributions are considered. This work develops a simple gradient-based approach for mainting such distributions and investigates methods for using them to direct exploration.

  • Files
  • Details
  • Metrics
Type
report
Author(s)
Dimitrakakis, Christos
Bengio, Samy  
Date Issued

2005

Publisher

IDIAP

Subjects

learning

Note

Published in PASCAL Workshop in Principled Methods of Trading Exploration and Exploitation, London, UK, 2005

URL

URL

http://publications.idiap.ch/downloads/reports/2005/dimitrakakis-idiap-rr-05-29.pdf
Written at

EPFL

EPFL units
LIDIAP  
Available on Infoscience
March 10, 2006
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/228698
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés