Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Student works
  4. Provable Convergence Guarantees for Constrained Inverse Reinforcement Learning
 
master thesis

Provable Convergence Guarantees for Constrained Inverse Reinforcement Learning

Renard, Titouan  
June 14, 2023

By incorporating known constraints into the inverse reinforcement learning (IRL) framework, con- strained inverse reinforcement learning (CIRL) can learn behaviors from expert demonstration while satisfying a set of pre-defined constraints. This makes CIRL relevant in safety-critical domains, as it provides a direct way to devise AI systems that enforce safety requirements. This master the- sis proposes and analyzes an algorithm, termed NPG-CIRL, that solves the problem of CIRL. Our algorithm implements a primal-dual scheme that extends the natural policy gradient (NPG) algo- rithm to the CIRL setting. We provide a finite-time analysis of the algorithm’s global convergence in the idealized exact gradient setting and the more practical stochastic gradient setting. We show that the algorithm requires $O(1/ε^2)$ gradient evaluations to reach an ε-approximate solution and to satisfy the imposed constraints. Our analysis also quantifies the sample complexity, showing that the algorithm requires $O(1/ε^4)$ samples to achieve convergence when using Monte Carlo gradient estimation techniques.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Renard2023.pdf

Type

N/a

Access type

openaccess

License Condition

copyright

Size

4.02 MB

Format

Adobe PDF

Checksum (MD5)

69be13014a0e33286d8f8cf2c3f52f42

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés