A Rosetta-based protein design protocol converging to natural sequences

Sormani, Giulia; Harteveld, Zander; Rosset, Stephane; Correia, Bruno; Laio, Alessandro

doi:10.1063/5.0039240

Sormani, Giulia; Harteveld, Zander; Rosset, Stephane; Correia, Bruno; Laio, Alessandro

2021

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Computational protein design has emerged as a powerful tool capable of identifying sequences compatible with pre-defined protein structures. The sequence design protocols, implemented in the Rosetta suite, have become widely used in the protein engineering community. To understand the strengths and limitations of the Rosetta design framework, we tested several design protocols on two distinct folds (SH3-1 and Ubiquitin). The sequence optimization, when started from native structures and natural sequences or polyvaline sequences, converges to sequences that are not recognized as belonging to the fold family of the target protein by standard bioinformatic tools, such as BLAST and Hmmer. The sequences generated from both starting conditions (native and polyvaline) are instead very similar to each other and recognized by Hmmer as belonging to the same "family." This demonstrates the capability of Rosetta to converge to similar sequences, even when sampling from distinct starting conditions, but, on the other hand, shows intrinsic inaccuracy of the scoring function that drifts toward sequences that lack identifiable natural sequence signatures. To address this problem, we developed a protocol embedding Rosetta Design simulations in a genetic algorithm, in which the sequence search is biased to converge to sequences that exist in nature. This protocol allows us to obtain sequences that have recognizable natural sequence signatures and, experimentally, the designed proteins are biochemically well behaved and thermodynamically stable.

Details

Title A Rosetta-based protein design protocol converging to natural sequences

Author(s) Sormani, Giulia ; Harteveld, Zander ; Rosset, Stephane ; Correia, Bruno ; Laio, Alessandro

Published in Journal Of Chemical Physics

Volume 154

Issue 7

Pages 074114

Date 2021-02-21

Publisher Melville, AMER INST PHYSICS

ISSN 0021-9606
1089-7690

DOI https://doi.org/10.1063/5.0039240

Other identifier(s) View record in Web of Science

Laboratories LPDI

Record Appears in Scientific production and competences > STI - School of Engineering > IBI-STI - Interfaculty Institute of Bioengineering > LPDI - Laboratory of Protein Design and Immunoengineering
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2021-03-26