Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Robust Data-Driven Dynamic Programming
 
conference paper

Robust Data-Driven Dynamic Programming

Hanasusanto, Grani A.
•
Kuhn, Daniel  
Burges, C. J. C.
•
Bottou, L.
Show more
2013
NIPS Proceedings 26
Neural Information Processing Systems

In stochastic optimal control the distribution of the exogenous noise is typically unknown and must be inferred from limited data before dynamic programming (DP)-based solution schemes can be applied. If the conditional expectations in the DP recursions are estimated via kernel regression, however, the historical sample paths enter the solution procedure directly as they determine the evaluation points of the cost-to-go functions. The resulting data-driven DP scheme is asymptotically consistent and admits efficient computational solution when combined with parametric value function approximations. If training data is sparse, however, the estimated cost-to-go functions display a high variability and an optimistic bias, while the corresponding control policies perform poorly in out-of-sample tests. To mitigate these small sample effects, we propose a robust data-driven DP scheme, which replaces the expectations in the DP recursions with worst-case expectations over a set of distributions close to the best estimate. We show that the arising min-max problems in the DP recursions reduce to tractable conic programs. We also demonstrate that this robust algorithm dominates state-of-the-art benchmark algorithms in out-of-sample tests across several application domains.

  • Details
  • Metrics
Type
conference paper
Author(s)
Hanasusanto, Grani A.
Kuhn, Daniel  
Editors
Burges, C. J. C.
•
Bottou, L.
•
Welling, M.
•
Ghahramani, Z.
•
Weinberger, K. Q.
Date Issued

2013

Published in
NIPS Proceedings 26
URL
http://papers.nips.cc/paper/5123-robust-data-driven-dynamic-programming
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
RAO  
Event nameEvent placeEvent date
Neural Information Processing Systems

Lake Tahoe, USA

December 2013

Available on Infoscience
January 29, 2014
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/100210
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés