Constructing a provably adversarially-robust classifier from a high accuracy one

Gluch, Grzegorz; Urbanke, Rudiger

conference paper

Gluch, Grzegorz

•

Urbanke, Rudiger

January 1, 2020

International Conference On Artificial Intelligence And Statistics, Vol 108

23rd International Conference on Artificial Intelligence and Statistics (AISTATS)

Modern machine learning models with very high accuracy have been shown to be vulnerable to small, adversarially chosen perturbations of the input. Given black-box access to a high-accuracy classifier f, we show how to construct a new classifier g that has high accuracy and is also robust to adversarial L2-bounded perturbations. Our algorithm builds upon the framework of randomized smoothing that has been recently shown to outperform all previous defenses against L2-bounded adversaries. Using techniques like random partitions and doubling dimension, we are able to bound the adversarial error of g in terms of the optimum error. In this paper we focus on our conceptual contribution, but we do present two examples to illustrate our framework. We will argue that, under some assumptions, our bounds are optimal for these cases.

Type

conference paper

Web of Science ID

WOS:000559931301021

Author(s)

Gluch, Grzegorz

Urbanke, Rudiger

Date Issued

2020-01-01

Publisher

ADDISON-WESLEY PUBL CO

Publisher place

Boston

Published in

International Conference On Artificial Intelligence And Statistics, Vol 108

Series title/Series vol.

Proceedings of Machine Learning Research

Volume

108

Start page

3674

End page

3683

Subjects

Computer Science, Artificial Intelligence

•

Statistics & Probability

•

Computer Science

•

Mathematics

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LTHC

THL4

Event name	Event place	Event date
23rd International Conference on Artificial Intelligence and Statistics (AISTATS)	ELECTR NETWORK	Aug 26-28, 2020

Available on Infoscience

October 25, 2020

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/172724