Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Israilov, Sardor; Fu, Li; Sanchez-Rodriguez, Jesus; Fusco, Franco; Allibert, Guillaume; Raufaste, Christophe; Argentina, Mederic

doi:10.1371/journal.pone.0280071

Israilov, Sardor; Fu, Li; Sanchez-Rodriguez, Jesus; Fusco, Franco; Allibert, Guillaume; Raufaste, Christophe; Argentina, Mederic

2023

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach.

Details

Title Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Author(s) Israilov, Sardor ; Fu, Li ; Sanchez-Rodriguez, Jesus ; Fusco, Franco ; Allibert, Guillaume ; Raufaste, Christophe ; Argentina, Mederic

Published in Plos One

Volume 18

Issue 2

Pages e0280071

Date 2023-02-13

Publisher San Francisco, PUBLIC LIBRARY SCIENCE

ISSN 1932-6203

DOI https://doi.org/10.1371/journal.pone.0280071

Other identifier(s) View record in Web of Science

Laboratories LFMI

Record Appears in Scientific production and competences > STI - School of Engineering > IGM - Institute of Mechanical Engineering > LFMI - Laboratory of Fluid Mechanics and Instabilities
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2023-04-24