Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. A generative AI approach to cost-effective advertisement based on synthetic images
 
conference paper

A generative AI approach to cost-effective advertisement based on synthetic images

Demirtas, Enes Eray  
•
Lu, Yuhang  
•
Delarive, Leila
Show more
Tescher, Andrew G.
•
Ebrahimi, Touradj
September 16, 2025
Applications of Digital Image Processing XLVIII
Applications of Digital Image Processing XLVIII

Advertisement heavily relies on compelling visuals to engage audiences across sectors. Recent advances in AIdriven text-to-image generation, particularly diffusion models like Stable Diffusion, offer novel opportunities for hyper-personalized and context-aware advertising content. However, challenges remain in precise control over image composition, segmentation robustness, and semantic consistency. In this work, we enhance the state-of-the-art Anywhere-Multi-Agent framework by replacing the original RMBG segmentation module with the Segment Anything Model (SAM), integrated via an interactive web interface enabling user-guided mask refinement. We further improve generation fidelity through prompt engineering with language models and explore multiple ControlNet conditioning strategies, including Canny, depth, and their combination modalities. Our experiments demonstrate significant gains in segmentation accuracy, object placement, and background coherence, facilitating flexible and precise image composition suitable for real-world advertising workflows. These modular improvements pave the way for scalable, controllable generative pipelines that better align AI outputs with user intent.

  • Details
  • Metrics
Type
conference paper
DOI
10.1117/12.3068096
Author(s)
Demirtas, Enes Eray  

École Polytechnique Fédérale de Lausanne

Lu, Yuhang  

École Polytechnique Fédérale de Lausanne

Delarive, Leila
Ebrahimi, Touradj  

École Polytechnique Fédérale de Lausanne

Editors
Tescher, Andrew G.
•
Ebrahimi, Touradj
Date Issued

2025-09-16

Publisher

SPIE

Published in
Applications of Digital Image Processing XLVIII
Series title/Series vol.

Proceedings; 13605

Start page

54

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
GR-EB  
Event nameEvent acronymEvent placeEvent date
Applications of Digital Image Processing XLVIII

San Diego, United States

2025-08-03 - 2025-08-08

Available on Infoscience
September 30, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/254457
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés