Constructing a provably adversarially-robust classifier from a high accuracy one

Gluch, Grzegorz; Urbanke, Rudiger

2020

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Modern machine learning models with very high accuracy have been shown to be vulnerable to small, adversarially chosen perturbations of the input. Given black-box access to a high-accuracy classifier f, we show how to construct a new classifier g that has high accuracy and is also robust to adversarial L2-bounded perturbations. Our algorithm builds upon the framework of randomized smoothing that has been recently shown to outperform all previous defenses against L2-bounded adversaries. Using techniques like random partitions and doubling dimension, we are able to bound the adversarial error of g in terms of the optimum error. In this paper we focus on our conceptual contribution, but we do present two examples to illustrate our framework. We will argue that, under some assumptions, our bounds are optimal for these cases.

Details

Title Constructing a provably adversarially-robust classifier from a high accuracy one

Author(s) Gluch, Grzegorz ; Urbanke, Rudiger

Published in International Conference On Artificial Intelligence And Statistics, Vol 108

Series Proceedings of Machine Learning Research

Volume 108

Pages 3674-3683

Conference 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), ELECTR NETWORK, Aug 26-28, 2020

Date 2020-01-01

Publisher Boston, ADDISON-WESLEY PUBL CO

ISSN 2640-3498

Other identifier(s) View record in Web of Science

Laboratories LTHC
THL4

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > THL4 - Theory of Computation Laboratory 4
Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LTHC - Communication Theories Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2020-10-25