Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Exploring “dark-matter” protein folds using deep learning
 
research article

Exploring “dark-matter” protein folds using deep learning

Harteveld, Zander  
•
Van Hall-Beauvais, Alexandra  
•
Morozova, Irina  
Show more
October 16, 2024
Cell Systems

De novo protein design explores uncharted sequence and structure space to generate novel proteins not sampled by evolution. A main challenge in de novo design involves crafting “designable” structural templates to guide the sequence searches toward adopting target structures. We present a convolutional variational autoencoder that learns patterns of protein structure, dubbed Genesis. We coupled Genesis with trRosetta to design sequences for a set of protein folds and found that Genesis is capable of reconstructing native-like distance and angle distributions for five native folds and three novel, the so-called “dark-matter” folds as a demonstration of generalizability. We used a high-throughput assay to characterize the stability of the designs through protease resistance, obtaining encouraging success rates for folded proteins. Genesis enables exploration of the protein fold space within minutes, unrestricted by protein topologies. Our approach addresses the backbone designability problem, showing that small neural networks can efficiently learn structural patterns in proteins. A record of this paper's transparent peer review process is included in the supplemental information.

  • Files
  • Details
  • Metrics
Type
research article
DOI
10.1016/j.cels.2024.09.006
Scopus ID

2-s2.0-85206959473

PubMed ID

39383860

Author(s)
Harteveld, Zander  

École Polytechnique Fédérale de Lausanne

Van Hall-Beauvais, Alexandra  

École Polytechnique Fédérale de Lausanne

Morozova, Irina  

École Polytechnique Fédérale de Lausanne

Southern, Joshua

Imperial College London

Goverde, Casper  

École Polytechnique Fédérale de Lausanne

Georgeon, Sandrine  

École Polytechnique Fédérale de Lausanne

Rosset, Stéphane  

École Polytechnique Fédérale de Lausanne

Defferrard, Michëal

École Polytechnique Fédérale de Lausanne

Loukas, Andreas  

École Polytechnique Fédérale de Lausanne

Vandergheynst, Pierre  

École Polytechnique Fédérale de Lausanne

Show more
Date Issued

2024-10-16

Published in
Cell Systems
Volume

15

Issue

10

Start page

898

End page

910.e5

Subjects

computational protein design

•

dark-matter folds

•

de novo design

•

high-throughput screening

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LPDI  
LTS2  
FunderFunding(s)Grant NumberGrant URL

Biltema Foundation

National Center of Competence in Research in Chemical Biology

Swiss National Supercomputing Centre

Show more
Available on Infoscience
January 25, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/243939
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés