Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Training Efficient Controllers via Analytic Policy Gradient
 
conference paper

Training Efficient Controllers via Analytic Policy Gradient

Wiedemann, Nina
•
Wüest, Valentin  
•
Loquercio, Antonio
Show more
2023
2023 Ieee International Conference On Robotics And Automation, Icra
2023 IEEE International Conference on Robotics and Automation (ICRA) "Embracing the Future: Making Robots for Humans"

Control design for robotic systems is complex and often requires solving an optimization to follow a trajectory accurately. Online optimization approaches like Model Predictive Control (MPC) have been shown to achieve great tracking performance, but require high computing power. Conversely, learning-based offline optimization approaches, such as Reinforcement Learning (RL), allow fast and efficient execution on the robot but hardly match the accuracy of MPC in trajectory tracking tasks. In systems with limited compute, such as aerial vehicles, an accurate controller that is efficient at execution time is imperative. We propose an Analytic Policy Gradient (APG) method to tackle this problem. APG exploits the availability of differentiable simulators by training a controller offline with gradient descent on the tracking error. We address training instabilities that frequently occur with APG through curriculum learning and experiment on a widely used controls benchmark, the CartPole, and two common aerial robots, a quadrotor and a fixed-wing drone. Our proposed method outperforms both model-based and model-free RL methods in terms of tracking error. Concurrently, it achieves similar performance to MPC while requiring more than an order of magnitude less computation time. Our work provides insights into the potential of APG as a promising control method for robotics.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICRA48891.2023.10160581
Web of Science ID

WOS:001036713001025

Author(s)
Wiedemann, Nina
Wüest, Valentin  
Loquercio, Antonio
Müller, Matthias
Floreano, Dario  
Scaramuzza, Davide  
Date Issued

2023

Publisher

New York

Publisher place

IEEE

Published in
2023 Ieee International Conference On Robotics And Automation, Icra
ISBN of the book

979-8-3503-2365-8

Start page

1349

End page

1356

Subjects

Aerial Robotics

•

Trajectory Tracking

•

Policy Learning

•

Model Predictive Control

•

Reinforcement Learning

URL

arXiv Link

https://arxiv.org/abs/2209.13052

Open Source Code

https://github.com/lis-epfl/apg_trajectory_tracking
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIS  
Event nameEvent placeEvent date
2023 IEEE International Conference on Robotics and Automation (ICRA) "Embracing the Future: Making Robots for Humans"

London

May 29-June 2, 2023

FunderGrant Number

FNS-NCCR

565520

Available on Infoscience
June 12, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/198218
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés