De novo protein design by inversion of the AlphaFold structure prediction network

Goverde, Casper; Wolf, Benedict; Khakzad, Hamed; Rosset, Stéphane; Correia, Bruno

doi:10.1002/pro.4653

Goverde, Casper; Wolf, Benedict; Khakzad, Hamed; Rosset, Stéphane; Correia, Bruno

2023

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

De novo protein design enhances our understanding of the principles that govern protein folding and interactions, and has the potential to revolutionize biotechnology through the engineering of novel protein functionalities. Despite recent progress in computational design strategies, de novo design of protein structures remains challenging, given the vast size of the sequence-structure space. AlphaFold2 (AF2), a state-of-the-art neural network architecture, achieved remarkable accuracy in predicting protein structures from amino acid sequences. This raises the question whether AF2 has learned the principles of protein folding sufficiently for de novo design. Here, we sought to answer this question by inverting the AF2 network, using the prediction weight set and a loss function to bias the generated sequences to adopt a target fold. Initial design trials resulted in de novo designs with an overrepresentation of hydrophobic residues on the protein surface compared to their natural protein family, requiring additional surface optimization. In silico validation of the designs showed protein structures with the correct fold, a hydrophilic surface and a densely packed hydrophobic core. In vitro validation showed that seven out of 39 designs were folded and stable in solution with high melting temperatures. In summary, our design workflow solely based on AF2 does not seem to fully capture basic principles of de novo protein design, as observed in the protein surface's hydrophobic vs. hydrophilic patterning. However, with minimal post-design intervention, these pipelines generated viable sequences as assessed experimental characterization. Thus such pipelines show the potential to contribute to solving outstanding challenges in de novo protein design.

Details

Title De novo protein design by inversion of the AlphaFold structure prediction network

Author(s) Goverde, Casper ; Wolf, Benedict ; Khakzad, Hamed ; Rosset, Stéphane ; Correia, Bruno

Published in Protein Science

Volume 36

Issue 6

Pages e4653

Date 2023-05-10

Keywords

De novo protein design; AlphaFold2; machine learning; structure prediction network inversion; computational structural biology

DOI https://doi.org/10.1002/pro.4653

Laboratories LPDI

Record Appears in Scientific production and competences > STI - School of Engineering > IBI-STI - Interfaculty Institute of Bioengineering > LPDI - Laboratory of Protein Design and Immunoengineering
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Grant EU funding: 716058

Record creation date 2023-05-22

Actions

Preview

Select file: