Multi-Armed Bandits for Addressing the Exploration/Exploitation Trade-off in Self Improving Learning Environment

Faucon, Louis Pierre

semester or other student projects

2017

This project proposes the use of machine learning techniques such as Multi-Armed Bandits to implement self-improving learning environments. The goal of a self-improving learning environment is to perform good pedagogical choices while measuring the efficiency of these choices. The modeling of students is done using the LFA model and fitted on a dataset of university courses to allow to simulate students. Three experiments with simulated students are carried out and show that the Multi-Armed Bandit approach improves learning outcomes.

Type

semester or other student projects

Author(s)

Faucon, Louis Pierre

Advisors

Dillenbourg, Pierre

Date Issued

2017

Subjects

Multi-Armed Bandit

•

Self-Improving Learning Environment

•

Education

•

chililearninganalytics

Written at

EPFL

EPFL units

CHILI

Available on Infoscience

August 9, 2017

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/139602