Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Controlled Training Data Generation with Diffusion Models
 
research article

Controlled Training Data Generation with Diffusion Models

Yeo, Shuqing Teresa  
•
Atanov, Andrei  
•
Benoit, Harold
Show more
2025
Transactions on Machine Learning Research

We present a method to control a text-to-image generative model to produce training data useful for supervised learning. Unlike previous works that employ an open-loop approach via pre-defined prompts to generate new data using either a language model or human expertise, we develop an automated closed-loop system that involves two feedback mechanisms. The first mechanism uses feedback from a given supervised model to find adversarial prompts that result in generated images that maximize the model’s loss and, consequently, expose its vulnerabilities. While these adversarial prompts generate training examples curated for improving the given model, they are not curated for a specific target distribution of interest, which can be inefficient. Therefore, we introduce the second feedback mechanism that can optionally guide the generation process towards a desirable target distribution. We call the method combining these two mechanisms Guided Adversarial Prompts. The proposed closed-loop system allows us to control the training data generation for a given model and target image distribution (see Fig. 1 (Right)). We evaluate on different tasks, datasets, and architectures, with different types of distribution shifts (corruptions, spurious correlations, unseen domains) and illustrate the advantages of the proposed feedback mechanisms compared to open-loop approaches.

  • Details
  • Metrics
Type
research article
Scopus ID

2-s2.0-105003283133

Author(s)
Yeo, Shuqing Teresa  
Atanov, Andrei  

EPFL

Benoit, Harold
Alekseev, Aleksandr  

EPFL

Ray, Ruchira
Akhoondi, Pooya Esmaeil
Corporate authors
Zamir, Amir  
Date Issued

2025

Published in
Transactions on Machine Learning Research
Volume

3(2025)

Start page

1341

End page

1345

URL

Online paper

https://openreview.net/pdf?id=sSOxuUjE2o
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
VILAB  
Available on Infoscience
May 5, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/249736
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés