Training Efficient Controllers via Analytic Policy Gradient

Wiedemann, Nina; Wüest, Valentin; Loquercio, Antonio; Müller, Matthias; Floreano, Dario; Scaramuzza, Davide

doi:10.1109/ICRA48891.2023.10160581

conference paper

Training Efficient Controllers via Analytic Policy Gradient

Wiedemann, Nina

•

Wüest, Valentin

•

Loquercio, Antonio

2023

2023 Ieee International Conference On Robotics And Automation, Icra

2023 IEEE International Conference on Robotics and Automation (ICRA) "Embracing the Future: Making Robots for Humans"

Control design for robotic systems is complex and often requires solving an optimization to follow a trajectory accurately. Online optimization approaches like Model Predictive Control (MPC) have been shown to achieve great tracking performance, but require high computing power. Conversely, learning-based offline optimization approaches, such as Reinforcement Learning (RL), allow fast and efficient execution on the robot but hardly match the accuracy of MPC in trajectory tracking tasks. In systems with limited compute, such as aerial vehicles, an accurate controller that is efficient at execution time is imperative. We propose an Analytic Policy Gradient (APG) method to tackle this problem. APG exploits the availability of differentiable simulators by training a controller offline with gradient descent on the tracking error. We address training instabilities that frequently occur with APG through curriculum learning and experiment on a widely used controls benchmark, the CartPole, and two common aerial robots, a quadrotor and a fixed-wing drone. Our proposed method outperforms both model-based and model-free RL methods in terms of tracking error. Concurrently, it achieves similar performance to MPC while requiring more than an order of magnitude less computation time. Our work provides insights into the potential of APG as a promising control method for robotics.

Type

conference paper

DOI

10.1109/ICRA48891.2023.10160581

Web of Science ID

WOS:001036713001025

Author(s)

Wiedemann, Nina

Wüest, Valentin

Loquercio, Antonio

Müller, Matthias

Floreano, Dario

Scaramuzza, Davide

Date Issued

2023

Publisher

New York

Publisher place

IEEE

Published in

2023 Ieee International Conference On Robotics And Automation, Icra

ISBN of the book

979-8-3503-2365-8

Start page

1349

End page

1356

Subjects

Aerial Robotics

•

Trajectory Tracking

•

Policy Learning

•

Model Predictive Control

•

Reinforcement Learning

URL

arXiv Link

https://arxiv.org/abs/2209.13052

Open Source Code

https://github.com/lis-epfl/apg_trajectory_tracking

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LIS

Event name	Event place	Event date
2023 IEEE International Conference on Robotics and Automation (ICRA) "Embracing the Future: Making Robots for Humans"	London	May 29-June 2, 2023

Funder	Grant Number
FNS-NCCR	565520

Available on Infoscience

June 12, 2023

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/198218