Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

Gado, Elena Grazia; Martorella, Tommaso; Zunino, Luca; Mejia-Domenzain, Paola; Swamy, Vinitra; Frej, Jibril; Käser, Tanja

doi:10.48550/arxiv.2405.20079

conference paper not in proceedings

Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

Gado, Elena Grazia

•

Martorella, Tommaso

•

Zunino, Luca

2024

17th International Conference on Educational Data Mining (EDM 2024)

Intelligent Tutoring Systems (ITS) enhance personalized learning by predicting student answers to provide immediate and customized instruction. However, recent research has primarily focused on the correctness of the answer rather than the student's performance on specific answer choices, limiting insights into students' thought processes and potential misconceptions. To address this gap, we present MCQStudentBert, an answer forecasting model that leverages the capabilities of Large Language Models (LLMs) to integrate contextual understanding of students' answering history along with the text of the questions and answers. By predicting the specific answer choices students are likely to make, practitioners can easily extend the model to new answer choices or remove answer choices for the same multiple-choice question (MCQ) without retraining the model. In particular, we compare MLP, LSTM, BERT, and Mistral 7B architectures to generate embeddings from students' past interactions, which are then incorporated into a finetuned BERT's answer-forecasting mechanism. We apply our pipeline to a dataset of language learning MCQ, gathered from an ITS with over 10,000 students to explore the predictive accuracy of MCQStudentBert, which incorporates student interaction patterns, in comparison to correct answer prediction and traditional mastery-learning feature-based approaches. This work opens the door to more personalized content, modularization, and granular support.

Name

2405.20079v1.pdf

Type

Preprint

Version

Submitted version (Preprint)

Access type

openaccess

License Condition

CC BY

Size

1.93 MB

Format

Adobe PDF

Checksum (MD5)

060e9dc3fdd26cb1da8638404e90f400