Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Fine-tuning protein language models to understand the functional impact of missense variants
 
research article

Fine-tuning protein language models to understand the functional impact of missense variants

Saadat, Ali
•
Fellay, Jacques  
January 1, 2025
Computational and Structural Biotechnology Journal

Elucidating the functional effects of missense variants is crucial yet challenging. To investigate their impact, we fine-tuned protein language models, including ESM2 and ProtT5, to classify 20 protein features at amino acid resolution. In addition, we trained a fully connected neural network classifier on frozen embeddings and compared its performance to fine-tuning in order to quantify the added value of task-specific adaptation. We then used the fine-tuned models to: 1) identify protein features enriched in either pathogenic or benign missense variants, and 2) compare the predicted feature profiles of proteins with reference and alternate alleles to understand how missense variants affect protein functionality. We show that our models can be used to reclassify variants of uncertain significance and provide mechanistic insights into the functional consequences of missense mutations.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

10.1016_j.csbj.2025.05.022.pdf

Type

Main Document

Version

Published version

Access type

openaccess

License Condition

CC BY

Size

2.35 MB

Format

Adobe PDF

Checksum (MD5)

01a10ad2f0f26d81e039a971a8fb5fde

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés