Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy

Zhang, Tong; Qiu, Congpei; Ke, Wei; Süsstrunk, Sabine; Salzmann, Mathieu; Sabine Susstrünk; Mathieu Salzmann

doi:10.1109/CVPR52688.2022.01608

Zhang, Tong; Qiu, Congpei; Ke, Wei; Süsstrunk, Sabine; Salzmann, Mathieu; Sabine Susstrünk; Mathieu Salzmann

2022

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Self-supervised learning (SSL) methods aim to learn view-invariant representations by maximizing the similarity between the features extracted from different crops of the same image regardless of cropping size and content. In essence, this strategy ignores the fact that two crops may truly contain different image information, e.g., background and small objects, and thus tends to restrain the diversity of the learned representations. In this work, we address this issue by introducing a new self-supervised learning strategy, LoGo, that explicitly reasons about Local and Global crops. To achieve view invariance, LoGo encourages similarity between global crops from the same image, as well as between a global and a local crop. However, to correctly encode the fact that the content of smaller crops may differ entirely, LoGo promotes two local crops to have dissimilar representations, while being close to global crops. Our LoGo strategy can easily be applied to existing SSL methods. Our extensive experiments on a variety of datasets and using different self-supervised learning frameworks validate its superiority over existing approaches. Noticeably, we achieve better results than supervised models on transfer learning when using only 1/10 of the data.

Details

Title Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy

Author(s) Zhang, Tong ; Qiu, Congpei ; Ke, Wei ; Süsstrunk, Sabine ; Salzmann, Mathieu ; Sabine Susstrünk ; Mathieu Salzmann

Published in 2022 Ieee/Cvf Conference On Computer Vision And Pattern Recognition (Cvpr 2022)

Series IEEE Conference on Computer Vision and Pattern Recognition

Pages 16559-16568

Conference IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, June 18-24, 2022

Date 2022-06-17

Publisher Los Alamitos, IEEE COMPUTER SOC

ISBN 978-1665469-46-3

DOI https://doi.org/10.1109/CVPR52688.2022.01608

Other identifier(s) View record in Web of Science

Laboratories IVRL
CVLAB
IVRL

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > IVRL - Image and Visual Representation Laboratory
Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > CVLAB - Computer Vision Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2022-12-12

Files

Abstract

Details

PDF