Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains
 
conference paper

AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains

Lis, Krzysztof  
•
Rottmann, Matthias  
•
Mütze, Annika
Show more
Magnenat Thalmann, Nadia
•
Hu, Xinrong
Show more
2024
35th British Machine Vision Conference, BMVC 2024
35 British Machine Vision Conference

In addition to impressive performance, vision transformers have demonstrated remarkable abilities to encode information they were not trained to extract. For example, this information can be used to perform segmentation or single-view depth estimation even though the networks were only trained for image recognition. We show that a similar phenomenon occurs when explicitly training transformers for semantic segmentation in a supervised manner for a set of categories: Once trained, they provide valuable information even about categories absent from the training set. This information can be used to segment objects from these never-seen-before classes in domains as varied as road obstacles, aircraft parked at a terminal, lunar rocks, and maritime hazards.

  • Files
  • Details
  • Metrics
Type
conference paper
Scopus ID

2-s2.0-105029687549

Author(s)
Lis, Krzysztof  

École Polytechnique Fédérale de Lausanne

Rottmann, Matthias  

École Polytechnique Fédérale de Lausanne

Mütze, Annika

Bergische Universität Wuppertal

Honari, Sina

Samsung AI Center Toronto

Fua, Pascal  

École Polytechnique Fédérale de Lausanne

Salzmann, Mathieu  

École Polytechnique Fédérale de Lausanne

Editors
Magnenat Thalmann, Nadia
•
Hu, Xinrong
•
Sheng, Bin
•
Thalmann, Daniel
•
Peng, Tao
•
Meng, Weiliang
•
Huang, Jin
•
Zhu, Lei
•
Wei, Xiong
Date Issued

2024

Publisher

British Machine Vision Association, BMVA

Published in
35th British Machine Vision Conference, BMVC 2024
Series title/Series vol.

Communications in Computer and Information Science; 2375 CCIS

ISSN (of the series)

1865-0937

1865-0929

Published in
Transactions on Machine Learning Research
Volume

2026-January

Start page

1

End page

37

URL

Online Proceedings

https://bmvc2024.org/proceedings/conference-proceedings/
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CVLAB  
Event nameEvent acronymEvent placeEvent date
35 British Machine Vision Conference

BMVC 2024

Glasgow, UK

2024-11-25 - 2024-11-28

Available on Infoscience
February 17, 2026
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/259532
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés