Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models

Tanwani, A. K.; Lee, J.; Thananjeyan, B.; Laskey, M.; Krishnan, S.; Fox, R.; Goldberg, K.; Calinon, S.

doi:10.1007/978-3-030-44051-0_12

conference paper

Generalizing Robot Imitation Learning with Invariant Hidden Semi-Markov Models

Tanwani, A. K.

•

Lee, J.

•

Thananjeyan, B.

more

2020

WAFR 2018: Algorithmic Foundations of Robotics XIII

13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR)

Generalizing manipulation skills to new situations requires extracting invariant patterns from demonstrations. For example, the robot needs to understand the demonstrations at a higher level while being invariant to the appearance of the objects, geometric aspects of objects such as its position, size, orientation and viewpoint of the observer in the demonstrations. In this paper, we propose an algorithm that learns a joint probability density function of the demonstrations with invariant formulations of hidden semi-Markov models to extract invariant segments (also termed as sub-goals or options), and smoothly follow the generated sequence of states with a linear quadratic tracking controller. The algorithm takes as input the demonstrations with respect to different coordinate systems describing virtual landmarks or objects of interest with a task-parameterized formulation, and adapt the segments according to the environmental changes in a systematic manner. We present variants of this algorithm in latent space with low-rank covariance decompositions, semi-tied covariances, and non-parametric online estimation of model parameters under small variance asymptotics; yielding considerably low sample and model complexity for acquiring new manipulation skills. The algorithm allows a Baxter robot to learn a pick-and-place task while avoiding a movable obstacle based on only 4 kinesthetic demonstrations.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/153649

Type

conference paper

DOI

10.1007/978-3-030-44051-0_12

Authors

Tanwani, A. K.

•

Lee, J.

•

Thananjeyan, B.

•

Laskey, M.

•

Krishnan, S.

•

Fox, R.

•

Goldberg, K.

•

Calinon, S.

Publication date

2020

Published in

WAFR 2018: Algorithmic Foundations of Robotics XIII

Start page

196

End page

211

Subjects

learning from demonst...

robot learning

small variance asympt...

URL

Event name	Event date
13th Intl Workshop on the Algorithmic Foundations of Robotics (WAFR)	December 9–11, 2018