Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch

Viano, Luca; Huang, Yu-Ting; Parameswaran, Kamalaruban; Weller, Adrian; Cevher, Volkan

conference paper not in proceedings

Viano, Luca

•

Huang, Yu-Ting

•

Parameswaran, Kamalaruban

2021

35th Conference on Neural Information Processing Systems (NeurIPS 2021)

We study the inverse reinforcement learning (IRL) problem under a transition dynamics mismatch between the expert and the learner. Specifically, we consider the Maximum Causal Entropy (MCE) IRL learner model and provide a tight upper bound on the learner’s performance degradation based on the `1-distance between the transition dynamics of the expert and the learner. Leveraging insights from the Robust RL literature, we propose a robust MCE IRL algorithm, which is a principled approach to help with this mismatch. Finally, we empirically demonstrate the stable performance of our algorithm compared to the standard MCE IRL algorithm under transition dynamics mismatches in both finite and continuous MDP problems.

Name

robust_inverse_reinforcement_l-Supplementary Material1.pdf

Type

Postprint

Version

Accepted version

Access type

openaccess

License Condition

Copyright

Size

1.61 MB

Format

Adobe PDF

Checksum (MD5)

b948502fbb23b56dba3dfc3b7b4b5b36