Optimal Adversarial Policies in the Multiplicative Learning System With a Malicious Expert

Etesami, S. Rasoul; Kiyavash, Negar; Leon, Vincent; Poor, H. Vincent

doi:10.1109/TIFS.2021.3052360

Etesami, S. Rasoul; Kiyavash, Negar; Leon, Vincent; Poor, H. Vincent

2021

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

We consider a learning system based on the conventional multiplicative weight ( MW) rule that combines experts' advice to predict a sequence of true outcomes. It is assumed that one of the experts is malicious and aims to impose the maximum loss on the system. The system's loss is naturally defined to be the aggregate absolute difference between the sequence of predicted outcomes and the true outcomes. We consider this problem under both offline and online settings. In the offline setting where the malicious expert must choose its entire sequence of decisions a priori, we show somewhat surprisingly that a simple greedy policy of always reporting false prediction is asymptotically optimal with an approximation ratio of 1+ O(root ln N/N), where N is the total number of prediction stages. In particular, we describe a policy that closely resembles the structure of the optimal offline policy. For the online setting where the malicious expert can adaptively make its decisions, we show that the optimal online policy can be efficiently computed by solving a dynamic program in O(N-3). We also discuss a generalization of our model to multi-expert settings. Our results provide a new direction for vulnerability assessment of commonly-used learning algorithms to internal adversarial attacks.

Details

Title Optimal Adversarial Policies in the Multiplicative Learning System With a Malicious Expert

Author(s) Etesami, S. Rasoul ; Kiyavash, Negar ; Leon, Vincent ; Poor, H. Vincent

Published in Ieee Transactions On Information Forensics And Security

Volume 16

Pages 2276-2287

Date 2021-01-01

Publisher Piscataway, IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

ISSN 1556-6013
1556-6021

Keywords

adversarial learning; expert advice; markov decision process; dynamic programming; approximation ratio

DOI https://doi.org/10.1109/TIFS.2021.3052360

Other identifier(s) View record in Web of Science

Laboratories BAN

Record Appears in Scientific production and competences > CDM - College of Management of Technology > MTEI - Management of Technology and Entrepreneurship Institute > BAN - Chair of Business Analytics
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2021-03-26

Abstract

Details

Actions