Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Preprints and Working Papers
  4. Secure and Federated Genome-Wide Association Studies for Biobank-Scale Datasets
 
preprint

Secure and Federated Genome-Wide Association Studies for Biobank-Scale Datasets

Cho, Hyunghoon
•
Froelicher, David  
•
Chen, Jeffrey
Show more
2022

Sharing data across multiple institutions for genome-wide association studies (GWAS) would enable discovery of novel genetic variants linked to health and disease. However, existing regulations on genomic data sharing and the sheer size of the data limit the scope of such collaborations. Although cryptographic tools for secure computation promise to enable collaborative studies with formal privacy guarantees, existing approaches either are computationally impractical or support only simplified analysis pipelines. Here, we introduce secure and federated genome-wide association studies (SF-GWAS), a novel combination of secure computation frameworks that empowers efficient and accurate GWAS in a federated manner, i.e., on private data locally-held by multiple entities, while provably ensuring end-to-end data confidentiality. Another key advance is that we designed SF-GWAS to support the two most widely-used GWAS pipelines—those based on principal component analysis (PCA) or linear mixed models (LMMs). We ran SF-GWAS on five real GWAS datasets, including a large UK Biobank cohort of 410K individuals, thereby demonstrating the largest secure genetics collaboration to date. SF-GWAS achieves an order-of-magnitude runtime improvement over the prior art for PCA-based GWAS and newly allows secure LMM-based association tests, for which its runtime scales at a near-constant rate in cohort size. Our work realizes the power of secure, collaborative, and accurate GWAS at unprecedented scale and should be applicable to a broad range of analyses. Our open-source software is at: https://github.com/hhcho/sfgwas.

  • Details
  • Metrics
Type
preprint
DOI
10.1101/2022.11.30.518537
Author(s)
Cho, Hyunghoon
Froelicher, David  
Chen, Jeffrey
Edupalli, Manaswitha
Pyrgelis, Apostolos  
Troncoso-Pastoriza, Juan R.  
Hubaux, Jean-Pierre  
Berger, Bonnie
Date Issued

2022

Editorial or Peer reviewed

NON-REVIEWED

Written at

EPFL

EPFL units
SPRING  
Available on Infoscience
December 19, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/193431
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés