Active Improvement of Control Policies with Bayesian Gaussian Mixture Model

Girgin, Hakan; Pignat, E.; Jaquier, N.; Calinon, S.

doi:10.1109/IROS45743.2020.9341187

conference paper

Active Improvement of Control Policies with Bayesian Gaussian Mixture Model

Girgin, Hakan

•

Pignat, E.

•

Jaquier, N.

2020

Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems

IEEE/RSJ Intl Conf. on Intelligent Robots and Systems

Learning from demonstration (LfD) is an intuitive framework allowing non-expert users to easily (re-)program robots. However, the quality and quantity of demonstrations have a great influence on the generalization performances of LfD approaches. In this paper, we introduce a novel active learning framework in order to improve the generalization capabilities of control policies. The proposed approach is based on the epistemic uncertainties of Bayesian Gaussian mixture models (BGMMs). We determine the new query point location by optimizing a closed-form information-density cost based on the quadratic Rényi entropy. Furthermore, to better represent uncertain regions and to avoid local optima problem, we propose to approximate the active learning cost with a Gaussian mixture model (GMM). We demonstrate our active learning framework in the context of a reaching task in a cluttered environment with an illustrative toy example and a real experiment with a Panda robot.

Type

conference paper

DOI

10.1109/IROS45743.2020.9341187

Author(s)

Girgin, Hakan

Pignat, E.

Jaquier, N.

Calinon, S.

Date Issued

2020

Publisher

IEEE

Published in

Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems

Subjects

active learning

•

Bayesian Gaussian mixture model

•

Learning from demonstrations

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LIDIAP

Event name	Event place	Event date
IEEE/RSJ Intl Conf. on Intelligent Robots and Systems	Las Vegas, NV, USA	24 Oct.-24 Jan. 2021

Available on Infoscience

April 13, 2021

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/177310