Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Text as a Richer Source of Supervision in Semantic Segmentation Tasks
 
conference paper

Text as a Richer Source of Supervision in Semantic Segmentation Tasks

Zermatten, Valérie  
•
Castillo Navarro, Javiera  
•
Hughes, Lloyd  
Show more
2023
IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium. Proceedings
International Geoscience and Remote Sensing Symposium (IGARSS)

This paper introduces TACOSS a text-image alignment approach that allows explainable land cover semantic segmentation by directly integrating semantic concepts encoded from texts. TACOSS combines convolutional neural networks for visual feature extraction with semantic embeddings provided by a language model. By leveraging contrastive learning approaches, we learn an alignment between the visual and the (fixed) textual representations. In addition to producing standard semantic segmentation outputs, our model enables interactive queries with RS images using natural language prompts. The experimental results obtained on 50cm resolution aerial data from Switzerland show that TACOSS performs similarly to a standard semantic segmentation model while allowing the flexible usage of in- and out-of-vocabulary terms for the interactions with the image.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

[IGRSS]Zermatten_Unformated.pdf

Type

Preprint

Version

http://purl.org/coar/version/c_71e4c1898caa6e32

Access type

openaccess

License Condition

CC BY

Size

14.82 MB

Format

Adobe PDF

Checksum (MD5)

abc874fd2e61ff76188514b2618074ce

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés