Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Leveraging Knowledge Profiles and Generative AI for Realistic Student Response Generation
 
conference paper

Leveraging Knowledge Profiles and Generative AI for Realistic Student Response Generation

Ipçi, Eylül  
•
Nazaretsky, Tatjana  
•
Käser, Tanja  
July 2025
Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium, Blue Sky, and WideAIED
26th International Conference on Artificial Intelligence in Education

The scarcity of high-quality labeled data often hinders the effective use of automated formative feedback in education. While analytic rubrics offer a reliable framework for automated grading, training robust models still requires hundreds of expert-labeled responses, an expensive and time-consuming process. This paper proposes a methodology for generating diverse, rubric-aligned synthetic student responses using large language models (LLMs) guided by knowledge profiles and representative examples. We introduce two profile-based generation strategies, straightforward and error-informed, and evaluate them compared to a dataset (N = 585) of authentic open-ended logical-proof responses from a Discrete Mathematics course. We analyze the diversity and realism of the generated datasets using embedding-based distance metrics and PCA and assess their utility for training automated grading models. Our results show that synthetic responses are less diverse than authentic ones, and models trained solely on generated data perform worse than those trained on real data. However, combining small authentic datasets with generated data significantly improves model performance, suggesting as an effective augmentation strategy in low-resource educational settings.

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-031-99264-3_18
Author(s)
Ipçi, Eylül  

EPFL

Nazaretsky, Tatjana  

EPFL

Käser, Tanja  

EPFL

Date Issued

2025-07

Publisher

Springer Nature

Publisher place

Switzerland

Published in
Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium, Blue Sky, and WideAIED
DOI of the book
https://doi.org/10.1007/978-3-031-99264-3
ISBN of the book

978-3-031-99264-3

Total of pages

414

Series title/Series vol.

Communications in Computer and Information Science; 2591

Start page

143

End page

151

Subjects

Generative AI

•

Data Augmentation

•

Higher Education

•

Formative Feedback

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
ML4ED  
LEARN
Event nameEvent acronymEvent placeEvent date
26th International Conference on Artificial Intelligence in Education

AIED 2025

Palermo, Italy

2025-07-22 - 2025-07-26

Available on Infoscience
August 21, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/253283
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés