Towards Robust Fine-grained Recognition by Maximal Separation of Discriminative Features
Adversarial attacks have been widely studied for general classification tasks, but remain unexplored in the context of fine-grained recognition, where the inter-class similarities facilitate the attacker's task. In this paper, we identify the proximity of the latent representations of local regions of different classes in fine-grained recognition networks as a key factor to the success of adversarial attacks. We therefore introduce an attention-based regularization mechanism that maximally separates the latent features of discriminative regions of different classes while minimizing the contribution of the non-discriminative regions to the final class prediction. As evidenced by our experiments, this allows us to significantly improve robustness to adversarial attacks, to the point of matching or even surpassing that of adversarial training, but without requiring access to adversarial samples. Further, our formulation also improves detection AUROC of adversarial samples over baselines on adversarially trained models.
WOS:001500994800024
École Polytechnique Fédérale de Lausanne
École Polytechnique Fédérale de Lausanne
2021-02-26
Cham
978-3-030-69543-9
978-3-030-69544-6
Part VI
Lecture Notes in Computer Science; 12627
0302-9743
391
408
REVIEWED
EPFL
| Event name | Event acronym | Event place | Event date |
ACCV 2020 | Kyoto, Japan | 2020-11-30 - 2020-12-04 | |