A Simple Learning Strategy for High-Speed Quadrocopter Multi-Flips

We describe a simple and intuitive policy gradient method for improving parametrized quadrocopter multi-flips by combining iterative experiments with information from a first-principles model. We start by formulating an N-flip maneuver as a five-step primitive with five adjustable parameters. Optimization using a low-order first-principles 2D vertical plane model of the quadrocopter yields an initial set of parameters and a corrective matrix. The maneuver is then repeatedly performed with the vehicle. At each iteration the state error at the end of the primitive is used to update the maneuver parameters via a gradient adjustment. The method is demonstrated at the ETH Zurich Flying Machine Arena testbed on quadrotor helicopters performing and improving on flips, double flips and triple flips.

Published in:
2010 Ieee International Conference On Robotics And Automation (Icra), 1642-1648
Presented at:
IEEE International Conference on Robotics and Automation (ICRA), Anchorage, AK, May 03-08, 2010
Ieee Service Center, 445 Hoes Lane, Po Box 1331, Piscataway, Nj 08855-1331 Usa

 Record created 2012-08-23, last modified 2018-09-13

Rate this document:

Rate this document:
(Not yet reviewed)