Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Proximal Point Imitation Learning
 
conference paper

Proximal Point Imitation Learning

Viano, Luca  
•
Kamoutsi, Angeliki
•
Neu, Gergely
Show more
2022
[Proceedings of NEURIPS 2022]
36th Conference on Neural Information Processing Systems (NeurIPS)

This work develops new algorithms with rigorous efficiency guarantees for infinite horizon imitation learning (IL) with linear function approximation without restrictive coherence assumptions. We begin with the minimax formulation of the problem and then outline how to leverage classical tools from optimization, in particular, the proximal-point method (PPM) and dual smoothing, for online and offline IL, respectively. Thanks to PPM, we avoid nested policy evaluation and cost updates for online IL appearing in the prior literature. In particular, we do away with the conventional alternating updates by the optimization of a single convex and smooth objective over both cost and Q-functions. When solved inexactly, we relate the optimization errors to the suboptimality of the recovered policy. As an added bonus, by re-interpreting PPM as dual smoothing with the expert policy as a center point, we also obtain an offline IL algorithm enjoying theoretical guarantees in terms of required expert trajectories. Finally, we achieve convincing empirical performance for both linear and neural network function approximation.

  • Files
  • Details
  • Metrics
Type
conference paper
Author(s)
Viano, Luca  
Kamoutsi, Angeliki
Neu, Gergely
Krawczuk, Igor  
Cevher, Volkan  orcid-logo
Date Issued

2022

Published in
[Proceedings of NEURIPS 2022]
Total of pages

54

Subjects

ml-ai

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIONS  
Event nameEvent placeEvent date
36th Conference on Neural Information Processing Systems (NeurIPS)

New Orleans, USA

November 28 - December 3, 2022

Available on Infoscience
October 4, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/191184
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés