Image-Guided Topic Modeling for Interpretable Privacy Classification

Baia, Alina Elena; Cavallaro, Andrea

doi:10.1007/978-3-031-92648-8_13

conference paper

Image-Guided Topic Modeling for Interpretable Privacy Classification

Baia, Alina Elena

•

Cavallaro, Andrea

Del Bue, Alessio

•

Canton, Cristian

2025

Computer Vision – ECCV 2024 Workshops, Proceedings

European Conference on Computer Vision

Predicting and explaining the private information contained in an image in human-understandable terms is a complex and contextual task. This task is challenging even for large language models. To facilitate the understanding of privacy decisions, we propose to predict image privacy based on a set of natural language content descriptors. These content descriptors are associated with privacy scores that reflect how people perceive image content. We generate descriptors with our novel Image-guided Topic Modeling (ITM) approach. ITM leverages, via multimodality alignment, both vision information and image textual descriptions from a vision language model. We use the ITM-generated descriptors to learn a privacy predictor, Priv×ITM, whose decisions are interpretable by design. Our Priv×ITM classifier outperforms the reference interpretable method by 5% points in accuracy and performs comparably to the current non-interpretable state-of-the-art model.

Type

conference paper

DOI

10.1007/978-3-031-92648-8_13

Author(s)

Baia, Alina Elena

Institut Dalle Molle D'intelligence Artificielle Perceptive

Cavallaro, Andrea

EPFL

Editors

Del Bue, Alessio

•

Canton, Cristian

•

Pont-Tuset, Jordi

•

Tommasi, Tatiana

Date Issued

2025

Publisher

Springer Science and Business Media Deutschland GmbH

Published in

Computer Vision – ECCV 2024 Workshops, Proceedings

Series title/Series vol.

Lecture Notes in Computer Science; 15643 LNCS

ISSN (of the series)

1611-3349

0302-9743

Start page

200

End page

217

Subjects

Interpretability

•

Topic modeling

•

Vision language models

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LIDIAP

Event name	Event acronym	Event place	Event date
European Conference on Computer Vision	ECCV 2024	Milan, Italy	2024-09-29 - 2024-10-04

Funder	Funding(s)	Grant Number	Grant URL
MUR PNRR project FAIR - Future AI Research		PE00000013
EU project AI4TRUST		101070190

Available on Infoscience

June 26, 2025

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/251631