Revisiting adversarial training for the worst-performing class

Pethick, Thomas Michaelsen; Chrysos, Grigorios; Cevher, Volkan

research article

Pethick, Thomas Michaelsen

•

Chrysos, Grigorios

•

Cevher, Volkan

2023

Transactions on Machine Learning Research

Despite progress in adversarial training (AT), there is a substantial gap between the topperforming and worst-performing classes in many datasets. For example, on CIFAR10, the accuracies for the best and worst classes are 74% and 23%, respectively. We argue that this gap can be reduced by explicitly optimizing for the worst-performing class, resulting in a min-max-max optimization formulation. Our method, called class focused online learning (CFOL), includes high probability convergence guarantees for the worst class loss and can be easily integrated into existing training setups with minimal computational overhead. We demonstrate an improvement to 32% in the worst class accuracy on CIFAR10, and we observe consistent behavior across CIFAR100 and STL10. Our study highlights the importance of moving beyond average accuracy, which is particularly important in safetycritical applications.

Name

439_revisiting_adversarial_trainin.pdf

Type

Postprint

Version

Accepted version

Access type

openaccess

License Condition

copyright

Size

866.68 KB

Format

Adobe PDF

Checksum (MD5)

9fac8dfcaf8bfa2459915683c9360892