PLSA-based Image Auto-Annotation: Constraining the Latent Space

Gatica-Perez, Daniel

doi:10.1145/1027527.1027608

Monay, Florent

•

Gatica-Perez, Daniel

2004

MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia

ACM Int. Conf. on Multimedia (ACM MM)

PLSA-based Image Auto-Annotation: Constraining the Latent Space

conference paper

We address the problem of unsupervised image auto-annotation with probabilistic latent space models. Unlike most previous works, which build latent space representations assuming equal relevance for the text and visual modalities, we propose a new way of modeling multi-modal co-occurrences, constraining the definition of the latent space to ensure its consistency in semantic terms (words), while retaining the ability to jointly model visual information. The concept is implemented by a linked pair of Probabilistic Latent Semantic Analysis (PLSA) models. On a 16000-image collection, we show with extensive experiments and using various performance measures, that our approach significantly outperforms previous joint models.

Name

monay-acm-1568937089.pdf

Access type

openaccess

Size

522.56 KB

Format

Adobe PDF

Checksum (MD5)

1aefaf319fa5d109277d44823faa07f6