Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Preprints and Working Papers
  4. Logit Pairing Methods Can Fool Gradient-Based Attacks
 
preprint

Logit Pairing Methods Can Fool Gradient-Based Attacks

Mosbach, Marius
•
Andriushchenko, Maksym  
•
Trost, Thomas
Show more
March 12, 2019

Recently, Kannan et al. [2018] proposed several logit regularization methods to improve the adversarial robustness of classifiers. We show that the computationally fast methods they propose - Clean Logit Pairing (CLP) and Logit Squeezing (LSQ) - just make the gradient-based optimization problem of crafting adversarial examples harder without providing actual robustness. We find that Adversarial Logit Pairing (ALP) may indeed provide robustness against adversarial examples, especially when combined with adversarial training, and we examine it in a variety of settings. However, the increase in adversarial accuracy is much smaller than previously claimed. Finally, our results suggest that the evaluation against an iterative PGD attack relies heavily on the parameters used and may result in false conclusions regarding robustness of a model.

  • Details
  • Metrics
Type
preprint
ArXiv ID

1810.12042

Author(s)
Mosbach, Marius
Andriushchenko, Maksym  
Trost, Thomas
Hein, Matthias
Klakow, Dietrich
Date Issued

2019-03-12

Editorial or Peer reviewed

NON-REVIEWED

Written at

OTHER

EPFL units
IINFCOM  
Available on Infoscience
December 6, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/163807
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés