CLIP the Gap: A Single Domain Generalization Approach for Object Detection

Vidit, Vidit; Engilberge, Martin; Salzmann, Mathieu

doi:10.1109/CVPR52729.2023.00314

conference paper

CLIP the Gap: A Single Domain Generalization Approach for Object Detection

Vidit, Vidit

•

Engilberge, Martin

•

Salzmann, Mathieu

January 1, 2023

2023 Ieee/Cvf Conference On Computer Vision And Pattern Recognition, Cvpr

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Single Domain Generalization (SDG) tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain. While this has been well studied for image classification, the literature on SDG object detection remains almost non-existent. To address the challenges of simultaneously learning robust object localization and representation, we propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts. We achieve this via a semantic augmentation strategy acting on the features extracted by the detector backbone, as well as a text-based classification loss. Our experiments evidence the benefits of our approach, outperforming by 10% the only existing SDG object detection method, Single-DGOD [52], on their own diverse weather-driving benchmark.

Type

conference paper

DOI

10.1109/CVPR52729.2023.00314

Web of Science ID

WOS:001058542603049

Author(s)

Vidit, Vidit

Engilberge, Martin

Salzmann, Mathieu

Date Issued

2023-01-01

Publisher

Ieee Computer Soc

Publisher place

Los Alamitos

Published in

2023 Ieee/Cvf Conference On Computer Vision And Pattern Recognition, Cvpr

ISBN of the book

979-8-3503-0129-8

Start page

3219

End page

3229

Subjects

Technology

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

CVLAB

Event name	Event place	Event date
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)	Vancouver, CANADA	JUN 17-24, 2023

Funder	Grant Number
Swiss National Science Foundation
Swiss Innovation Agency (Innosuisse) via the BRIDGE Discovery grant	40B2-0 194729

Available on Infoscience

February 16, 2024

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/203771