Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Get the Best of Both Worlds: Improving Accuracy and Transferability by Grassmann Class Representation
 
conference paper

Get the Best of Both Worlds: Improving Accuracy and Transferability by Grassmann Class Representation

Wang, Haoqi  
•
Li, Zhizhong
•
Zhang, Wayne
January 1, 2023
2023 Ieee/Cvf International Conference On Computer Vision (Iccv 2023)
IEEE/CVF International Conference on Computer Vision (ICCV)

We generalize the class vectors found in neural networks to linear subspaces (i.e., points in the Grassmann manifold) and show that the Grassmann Class Representation (GCR) enables simultaneous improvement in accuracy and feature transferability. In GCR, each class is a subspace, and the logit is defined as the norm of the projection of a feature onto the class subspace. We integrate Riemannian SGD into deep learning frameworks such that class subspaces in a Grassmannian are jointly optimized with the rest model parameters. Compared to the vector form, the representative capability of subspaces is more powerful. We show that on ImageNet-1K, the top-1 errors of ResNet50-D, ResNeXt50, Swin-T, and Deit3-S are reduced by 5.6%, 4.5%, 3.0%, and 3.5%, respectively. Subspaces also provide freedom for features to vary, and we observed that the intra-class feature variability grows when the subspace dimension increases. Consequently, we found the quality of GCR features is better for downstream tasks. For ResNet50-D, the average linear transfer accuracy across 6 datasets improves from 77.98% to 79.70% compared to the strong baseline of vanilla softmax. For Swin-T, it improves from 81.5% to 83.4% and for Deit3, it improves from 73.8% to 81.4%. With these encouraging results, we believe that more applications could benefit from the Grassmann class representation.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICCV51070.2023.02054
Web of Science ID

WOS:001169500507006

Author(s)
Wang, Haoqi  
Li, Zhizhong
Zhang, Wayne
Corporate authors
IEEE
Date Issued

2023-01-01

Publisher

Ieee Computer Soc

Publisher place

Los Alamitos

Published in
2023 Ieee/Cvf International Conference On Computer Vision (Iccv 2023)
ISBN of the book

979-8-3503-0718-4

Start page

22421

End page

22430

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
CVLAB  
Event nameEvent placeEvent date
IEEE/CVF International Conference on Computer Vision (ICCV)

Paris, FRANCE

OCT 02-06, 2023

Available on Infoscience
May 1, 2024
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/207563
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés