Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Drynx: Decentralized, Secure, Verifiable System for Statistical Queries and Machine Learning on Distributed Datasets
 
research article

Drynx: Decentralized, Secure, Verifiable System for Statistical Queries and Machine Learning on Distributed Datasets

Froelicher, David  
•
Troncoso-Pastoriza, Juan Ramon  
•
Sousa, Joao Sa  
Show more
January 1, 2020
Ieee Transactions On Information Forensics And Security

Data sharing has become of primary importance in many domains such as big-data analytics, economics and medical research, but remains difficult to achieve when the data are sensitive. In fact, sharing personal information requires individuals' unconditional consent or is often simply forbidden for privacy and security reasons. In this paper, we propose Drynx, a decentralized system for privacy-conscious statistical analysis on distributed datasets. Drynx relies on a set of computing nodes to enable the computation of statistics such as standard deviation or extrema, and the training and evaluation of machine-learning models on sensitive and distributed data. To ensure data confidentiality and the privacy of the data providers, Drynx combines interactive protocols, homomorphic encryption, zero-knowledge proofs of correctness, and differential privacy. It enables an efficient and decentralized verification of the input data and of all the system's computations thus provides auditability in a strong adversarial model in which no entity has to be individually trusted. Drynx is highly modular, dynamic and parallelizable. Our evaluation shows that it enables the training of a logistic regression model on a dataset (12 features and 600,000 records) distributed among 12 data providers in less than 2 seconds. The computations are distributed among 6 computing nodes, and Drynx enables the verification of the query execution's correctness in less than 22 seconds.

  • Details
  • Metrics
Type
research article
DOI
10.1109/TIFS.2020.2976612
Web of Science ID

WOS:000527936100001

Author(s)
Froelicher, David  
Troncoso-Pastoriza, Juan Ramon  
Sousa, Joao Sa  
Hubaux, Jean-Pierre  
Date Issued

2020-01-01

Published in
Ieee Transactions On Information Forensics And Security
Volume

15

Start page

3035

End page

3050

Subjects

Computer Science, Theory & Methods

•

Engineering, Electrical & Electronic

•

Computer Science

•

Engineering

•

computational modeling

•

machine learning

•

encryption

•

privacy

•

decentralized system

•

distributed datasets

•

statistics

•

homomorphic encryption

•

zero-knowledge proofs

•

differential privacy

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LDS  
DEDIS  
Available on Infoscience
May 7, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/168610
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés