Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task

Mohammadshahi, Alireza; Lebret, Rémi Philippe; Aberer, Karl

doi:10.18653/v1/D19-6605

Mohammadshahi, Alireza; Lebret, Rémi Philippe; Aberer, Karl

2019

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper, we propose a new approach to learn multimodal multilingual embeddings for matching images and their relevant captions in two languages. We combine two existing objective functions to make images and captions close in a joint embedding space while adapting the alignment of word embeddings between existing languages in our model. We show that our approach enables better generalization, achieving state-of-the-art performance in text-to-image and image-to-text retrieval task, and caption-caption similarity task. Two multimodal multilingual datasets are used for evaluation: Multi30k with German and English captions and Microsoft-COCO with English and Japanese captions.

Details

Title Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task

Author(s) Mohammadshahi, Alireza ; Lebret, Rémi Philippe ; Aberer, Karl

Published in Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER)

Pagination 7

Pages 27-33

Conference 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Hong Kong, China, November 3-7, 2019

Date 2019-11-03

Publisher Hong Kong, Association for Computational Linguistics

Keywords

NLP; Deep Learning; Image; caption; retrieval

Note This article is licensed under a Creative Commons Attribution 4.0 International License

DOI https://doi.org/10.18653/v1/D19-6605

Additional link URL

Laboratories LSIR
LIDIAP

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LSIR - Distributed Information Systems Laboratory
Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2019-12-12

Actions

Preview

Select file: