A Comparison of PSO and Reinforcement Learning for Multi-Robot Obstacle Avoidance

Di Mario, Ezequiel; Talebpour, Zeynab; Martinoli, Alcherio

doi:10.1109/CEC.2013.6557565

Di Mario, Ezequiel; Talebpour, Zeynab; Martinoli, Alcherio

2013

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

The design of high-performing robotic controllers constitutes an example of expensive optimization in uncertain environments due to the often large parameter space and noisy performance metrics. There are several evaluative techniques that can be employed for on-line controller design. Adequate benchmarks help in the choice of the right algorithm in terms of final performance and evaluation time. In this paper, we use multi-robot obstacle avoidance as a benchmark to compare two different evaluative learning techniques: Particle Swarm Optimization and Q-learning. For Q-learning, we implement two different approaches: one with discrete states and discrete actions, and another one with discrete actions but a continuous state space. We show that continuous PSO has the highest fitness overall, and Q-learning with continuous states performs significantly better than Q-learning with discrete states. We also show that in the single robot case, PSO and Q-learning with discrete states require a similar amount of total learning time to converge, while the time required with Q-learning with continuous states is significantly larger. In the multi-robot case, both Q-learning approaches require a similar amount of time as in the single robot case, but the time required by PSO can be significantly reduced due to the distributed nature of the algorithm.

Details

Title A Comparison of PSO and Reinforcement Learning for Multi-Robot Obstacle Avoidance

Author(s) Di Mario, Ezequiel ; Talebpour, Zeynab ; Martinoli, Alcherio

Published in 2013 IEEE Congress on Evolutionary Computation (CEC)

Pagination 8

Pages 149-156

Conference IEEE Congress on Evolutionary Computation, Cancún, México, June 20-23, 2013

Date 2013

Publisher New York, IEEE

ISBN 978-1-4799-0454-9

Keywords

Obstacle Avoidance; Q-Learning; Reinforcement Learning; Particle Swarm Optimization; Robotics

DOI https://doi.org/10.1109/CEC.2013.6557565

Other identifier(s) View record in Web of Science

Laboratories NCCR-ROBOTICS
DISAL

Record Appears in Scientific production and competences > ENAC - School of Architecture, Civil and Environmental Engineering > IIE - Environmental Engineering Institute > DISAL - Distributed Intelligent Systems and Algorithms Laboratory
Scientific production and competences > EPFL Partners > NCCR-ROBOTICS - National Centre of Competence in Research (NCCR) Robotics
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2013-05-20

Files

Abstract

Details

PDF