Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Training Efficient Controllers via Analytic Policy Gradient
 
conference paper

Training Efficient Controllers via Analytic Policy Gradient

Wiedemann, Nina
•
Wüest, Valentin  
•
Loquercio, Antonio
Show more
2023
2023 Ieee International Conference On Robotics And Automation, Icra
2023 IEEE International Conference on Robotics and Automation (ICRA) "Embracing the Future: Making Robots for Humans"

Control design for robotic systems is complex and often requires solving an optimization to follow a trajectory accurately. Online optimization approaches like Model Predictive Control (MPC) have been shown to achieve great tracking performance, but require high computing power. Conversely, learning-based offline optimization approaches, such as Reinforcement Learning (RL), allow fast and efficient execution on the robot but hardly match the accuracy of MPC in trajectory tracking tasks. In systems with limited compute, such as aerial vehicles, an accurate controller that is efficient at execution time is imperative. We propose an Analytic Policy Gradient (APG) method to tackle this problem. APG exploits the availability of differentiable simulators by training a controller offline with gradient descent on the tracking error. We address training instabilities that frequently occur with APG through curriculum learning and experiment on a widely used controls benchmark, the CartPole, and two common aerial robots, a quadrotor and a fixed-wing drone. Our proposed method outperforms both model-based and model-free RL methods in terms of tracking error. Concurrently, it achieves similar performance to MPC while requiring more than an order of magnitude less computation time. Our work provides insights into the potential of APG as a promising control method for robotics.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

arXiv_Wiedemann_Wueest.pdf

Type

Preprint

Version

http://purl.org/coar/version/c_71e4c1898caa6e32

Access type

openaccess

License Condition

n/a

Size

519.5 KB

Format

Adobe PDF

Checksum (MD5)

f9aad8c87e120d5b0d9cf8f5d8bfa665

Loading...
Thumbnail Image
Name

analytic_policy_gradient_graphical_abstract.jpg

Type

Thumbnail

Access type

openaccess

License Condition

n/a

Size

1.4 MB

Format

JPEG

Checksum (MD5)

ceabb0451c6293e754e3fd3ebdcd1108

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés