A Large-Scale Database of Images and Captions for Automatic Face Naming

Ozcan, Mert; Luo, Jie; Ferrari, Vittorio; Caputo, Barbara

Ozcan, Mert; Luo, Jie; Ferrari, Vittorio; Caputo, Barbara

2011

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We present a large scale database of images and captions, designed for supporting research on how to use captioned images from the Web for training visual classifiers. It consists of more than 125,000 images of celebrities from different fields downloaded from the Web. Each image is associated to its original text caption, extracted from the html page the image comes from. We coin it FAN-Large, for Face And Names Large scale database. Its size and deliberate high level of noise makes it to our knowledge the largest and most realistic database supporting this type of research. The dataset and its annotations are publicly available and can be obtained from http://www.vision.ee.ethz.ch/~calvin/fanlarge/. We report results on a thorough assessment of FAN-Large using several existing approaches for name-face association, and present and evaluate new contextual features derived from the caption. Our findings provide important cues on the strengths and limitations of existing approaches.

Details

Title A Large-Scale Database of Images and Captions for Automatic Face Naming

Author(s) Ozcan, Mert ; Luo, Jie ; Ferrari, Vittorio ; Caputo, Barbara

Date 2011

Publisher Idiap

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports

Record creation date 2013-12-19

Files

Abstract

Details

PDF