Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation

Luo, Jie; Caputo, Barbara; Ferrari, Vittorio

Luo, Jie; Caputo, Barbara; Ferrari, Vittorio

2009

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

Given a corpus of news items consisting of images accompanied by text captions, we want to find out “who’s doing what”, i.e. associate names and action verbs in the captions to the face and body pose of the persons in the images. We present a joint model for simultaneously solving the image-caption correspondences and learning visual appearance models for the face and pose classes occurring in the corpus. These models can then be used to recognize people and actions in novel images without captions. We demonstrate experimentally that our joint ‘face and pose’ model solves the correspondence problem better than earlier models covering only the face, and that it can perform recognition of new uncaptioned images.

Détails

Titre Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation

Auteur(s) Luo, Jie ; Caputo, Barbara ; Ferrari, Vittorio

Publié dans Advances in Neural Information Processing Systems

Volume 22

Pages 1168-1176

Présenté à NIPS Foundation - Advances in Neural Information Processing Systems 22 (NIPS09), Vancouver, B.C., Canada

Date 2009

Editeur MIT Press

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Papiers de conférence
Travail produit à l'EPFL
Publié

Date de création de la notice 2010-02-11

Actions

Aperçu

Sélectionner le fichier :