Compact Q-Learning Optimized for Micro-robots with Processing and Memory Constraints
Scaling down robots to miniature size introduces many new challenges including memory and program size limitations, low processor performance and low power autonomy. In this paper we describe the concept and implementation of learning of a safewandering task with the autonomous micro-robots, Alice. We propose a simplified reinforcement learning algorithm based on one-step Qlearning that is optimized in speed and memory consumption. This algorithm uses only integer-based sum operators and avoids floatingpoint and multiplication operators. Finally, quality of learning is compared to a floating-point based algorithm.
RAS-sent for print.pdf
openaccess
1.44 MB
Adobe PDF
7499551f685e04de39f343e65a3d790e