Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm
 
conference paper

Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm

Renard, Titouan  
•
Schlaginhaufen, Andreas  
•
Ni, Tingting  
Show more
December 2024
Proceedings of the IEEE Conference on Decision and Control
63rd IEEE Conference on Decision and Control

Given a dataset of expert demonstrations, inverse reinforcement learning (IRL) aims to recover a reward for which the expert is optimal. This work proposes a model-free algorithm to solve the entropy-regularized IRL problem. In particular, we employ a stochastic gradient descent update for the reward and a stochastic soft policy iteration update for the policy. Assuming access to a generative model, we prove that our algorithm is guaranteed to recover a reward for which the expert is -optimal using an expected number of O(1 / 2) samples of the Markov decision process (MDP). Furthermore, with an expected number of O(1 / 4) samples we prove that the optimal policy corresponding to the recovered reward is -close to the expert policy in total variation distance.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/CDC56724.2024.10886001
Scopus ID

2-s2.0-86000622747

Author(s)
Renard, Titouan  

École Polytechnique Fédérale de Lausanne

Schlaginhaufen, Andreas  

EPFL

Ni, Tingting  

EPFL

Kamgarpour, Maryam  

EPFL

Date Issued

2024-12

Publisher

Institute of Electrical and Electronics Engineers Inc.

Published in
Proceedings of the IEEE Conference on Decision and Control
DOI of the book
https://doi.org/10.1109/CDC56724.2024
ISBN of the book

9798350316339

Start page

8258

End page

8263

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
SYCAMORE  
Event nameEvent acronymEvent placeEvent date
63rd IEEE Conference on Decision and Control

CDC 2024

Milan, Italy

2024-12-16 - 2024-12-19

FunderFunding(s)Grant NumberGrant URL

Swiss Data Science Center

Swiss National Science Foundation

Available on Infoscience
March 25, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/248233
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés