Abstract

The optimization of therapeutic antibodies is time-intensive and resource-demanding, largely because of the low-throughput screening of full-length antibodies (approximately 1 x 10(3) variants) expressed in mammalian cells, which typically results in few optimized leads. Here we show that optimized antibody variants can be identified by predicting antigen specificity via deep learning from a massively diverse space of antibody sequences. To produce data for training deep neural networks, we deep-sequenced libraries of the therapeutic antibody trastuzumab (about 1 x 10(4) variants), expressed in a mammalian cell line through site-directed mutagenesis via CRISPR-Cas9-mediated homology-directed repair, and screened the libraries for specificity to human epidermal growth factor receptor 2 (HER2). We then used the trained neural networks to screen a computational library of approximately 1 x 10(8) trastuzumab variants and predict the HER2-specific subset (approximately 1 x 10(6) variants), which can then be filtered for viscosity, clearance, solubility and immunogenicity to generate thousands of highly optimized lead candidates. Recombinant expression and experimental testing of 30 randomly selected variants from the unfiltered library showed that all 30 retained specificity for HER2. Deep learning may facilitate antibody engineering and optimization.

Therapeutic antibodies can be optimized using deep-learning models trained on antibody-mutagenesis libraries to generate antibody variants and predict their antigen specificity.

Details

Actions