Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Language Transformers for Remote Sensing Visual Question Answering
 
conference paper

Language Transformers for Remote Sensing Visual Question Answering

Chappuis, Christel  
•
Mendez, Vincent Alexandre  
•
Walt, Eliot
Show more
July 2022
IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium
2022 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2022)

Remote sensing visual question answering (RSVQA) opens new avenues to promote the use of satellites data, by interfacing satellite image analysis with natural language processing. Capitalizing on the remarkable advances in natural language processing and computer vision, RSVQA aims at finding an answer to a question formulated by a human user about a remote sensing image. This is achieved by extracting representations from images and questions, and then fusing them in a joint representation. Focusing on the language part of the architecture, this study compares and evaluates the adequacy to the RSVQA task of two language models, a traditional recurrent neural network (Skip-thoughts) and a recent attentionbased Transformer (BERT). We study whether large transformer models are beneficial to the task and whether fine-tuning is needed for these models to perform at their best. Our findings show that the models benefit from fine-tuning language models and that RSVQA with BERT is slightly but consistently better when properly fine-tuned.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/IGARSS46834.2022.9884036
Author(s)
Chappuis, Christel  
Mendez, Vincent Alexandre  
Walt, Eliot
Lobry, Sylvain
Le Saux, Bertrand
Tuia, Devis  
Date Issued

2022-07

Publisher

IEEE

Published in
IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium
ISBN of the book

978-1-665427-92-0

Total of pages

4

Start page

4855

End page

4858

Subjects

Remote Sensing Visual Question Answering

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
ECEO  
Event nameEvent placeEvent date
2022 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2022)

Kuala Lumpur, Malaysia

July 17-22, 2022

Available on Infoscience
January 30, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/194538
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés