semester or other student projects
Multi-Armed Bandits for Addressing the Exploration/Exploitation Trade-off in Self Improving Learning Environment
2017
This project proposes the use of machine learning techniques such as Multi-Armed Bandits to implement self-improving learning environments. The goal of a self-improving learning environment is to perform good pedagogical choices while measuring the efficiency of these choices. The modeling of students is done using the LFA model and fitted on a dataset of university courses to allow to simulate students. Three experiments with simulated students are carried out and show that the Multi-Armed Bandit approach improves learning outcomes.
Type
semester or other student projects
Author(s)
Advisors
Date Issued
2017
Written at
EPFL
EPFL units
Available on Infoscience
August 9, 2017
Use this identifier to reference this record