Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. CLIP the Gap: A Single Domain Generalization Approach for Object Detection
 
conference paper

CLIP the Gap: A Single Domain Generalization Approach for Object Detection

Vidit, Vidit
•
Engilberge, Martin  
•
Salzmann, Mathieu  
January 1, 2023
2023 Ieee/Cvf Conference On Computer Vision And Pattern Recognition, Cvpr
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Single Domain Generalization (SDG) tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain. While this has been well studied for image classification, the literature on SDG object detection remains almost non-existent. To address the challenges of simultaneously learning robust object localization and representation, we propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts. We achieve this via a semantic augmentation strategy acting on the features extracted by the detector backbone, as well as a text-based classification loss. Our experiments evidence the benefits of our approach, outperforming by 10% the only existing SDG object detection method, Single-DGOD [52], on their own diverse weather-driving benchmark.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/CVPR52729.2023.00314
Web of Science ID

WOS:001058542603049

Author(s)
Vidit, Vidit
•
Engilberge, Martin  
•
Salzmann, Mathieu  
Date Issued

2023-01-01

Publisher

Ieee Computer Soc

Publisher place

Los Alamitos

Published in
2023 Ieee/Cvf Conference On Computer Vision And Pattern Recognition, Cvpr
ISBN of the book

979-8-3503-0129-8

Start page

3219

End page

3229

Subjects

Technology

Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CVLAB  
Event nameEvent placeEvent date
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Vancouver, CANADA

JUN 17-24, 2023

FunderGrant Number

Swiss National Science Foundation

Swiss Innovation Agency (Innosuisse) via the BRIDGE Discovery grant

40B2-0 194729

Available on Infoscience
February 16, 2024
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/203771
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés