Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Image-Guided Topic Modeling for Interpretable Privacy Classification
 
conference paper

Image-Guided Topic Modeling for Interpretable Privacy Classification

Baia, Alina Elena
•
Cavallaro, Andrea  
Del Bue, Alessio
•
Canton, Cristian
Show more
2025
Computer Vision – ECCV 2024 Workshops, Proceedings
European Conference on Computer Vision

Predicting and explaining the private information contained in an image in human-understandable terms is a complex and contextual task. This task is challenging even for large language models. To facilitate the understanding of privacy decisions, we propose to predict image privacy based on a set of natural language content descriptors. These content descriptors are associated with privacy scores that reflect how people perceive image content. We generate descriptors with our novel Image-guided Topic Modeling (ITM) approach. ITM leverages, via multimodality alignment, both vision information and image textual descriptions from a vision language model. We use the ITM-generated descriptors to learn a privacy predictor, Priv×ITM, whose decisions are interpretable by design. Our Priv×ITM classifier outperforms the reference interpretable method by 5% points in accuracy and performs comparably to the current non-interpretable state-of-the-art model.

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-031-92648-8_13
Author(s)
Baia, Alina Elena

Institut Dalle Molle D'intelligence Artificielle Perceptive

Cavallaro, Andrea  

EPFL

Editors
Del Bue, Alessio
•
Canton, Cristian
•
Pont-Tuset, Jordi
•
Tommasi, Tatiana
Date Issued

2025

Publisher

Springer Science and Business Media Deutschland GmbH

Published in
Computer Vision – ECCV 2024 Workshops, Proceedings
Series title/Series vol.

Lecture Notes in Computer Science; 15643 LNCS

ISSN (of the series)

1611-3349

0302-9743

Start page

200

End page

217

Subjects

Interpretability

•

Topic modeling

•

Vision language models

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent acronymEvent placeEvent date
European Conference on Computer Vision

ECCV 2024

Milan, Italy

2024-09-29 - 2024-10-04

FunderFunding(s)Grant NumberGrant URL

MUR PNRR project FAIR - Future AI Research

PE00000013

EU project AI4TRUST

101070190

Available on Infoscience
June 26, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/251631
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés