Byzantine machine learning

El Mhamdi, El Mahdi; Guerraoui, Rachid; Rouault, Sébastien; Taziki, Mahsa

El Mhamdi, El Mahdi; Guerraoui, Rachid; Rouault, Sébastien; Taziki, Mahsa

2020

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

The present invention concerns computer-implemented methods for training a machine learning model using Stochastic Gradient Descent, SGD. In one embodiment, the method is performed by a first computer in a distributed computing environment and comprises performing a learning round, comprising broadcasting a parameter vector to a plurality of worker computers in the distributed computing environment, and upon receipt of one or more respective estimate vectors from a subset of the worker computers, determining an updated parameter vector for use in a next learning round based on the one or more received estimate vectors, wherein the determining comprises ignoring an estimate vector received from a given worker computer when a sending frequency of the given worker computer is above a threshold value. The method aggregates the gradients in an asynchronous communication model with unbounded communication delays.

Details

Title Byzantine machine learning

Author(s) El Mhamdi, El Mahdi ; Guerraoui, Rachid ; Rouault, Sébastien ; Taziki, Mahsa

Date 2020

Keywords

TTO:6.1896

Note Alternative title(s) : (fr) Apprentissage automatique byzantin

Other identifier(s) EPO Family ID: 62981189

Patent number(s) WO2020011361 (A1)

Laboratories TTO
DCL

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > DCL - Distributed Computing Laboratory
Scientific production and competences > Non-academic units > TTO - Technology Transfer Office
Work produced at EPFL
Patents

Record creation date 2020-02-03

Abstract

Details

Actions