Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Preprints and Working Papers
  4. Towards improving full-length ribosome density prediction by bridging sequence and graph-based representations
This is not the latest version of this item. The latest version can be found here.
 
preprint

Towards improving full-length ribosome density prediction by bridging sequence and graph-based representations

Nallapareddy, Mohan Vamsi  
•
Craighero, Francesco  
•
Gobet, Cédric  
Show more
2024

Translation elongation plays an important role in regulating protein concentrations in the cell, and dysregulation of this process has been linked to several human diseases. In this study, we use data from ribo-seq experiments to model ribosome dwell times, and in turn, predict the speed of translation. The proposed method, RiboGL, combines graph and recurrent neural networks to account for both graph and sequence-based features. The model takes a mixed graph representing the secondary structure of the mRNA sequence as input, which incorporates both sequence and structure codon neighbors. In our experiments, RiboGL greatly outperforms the state-of-the-art RiboMIMO model for ribosome density prediction. We also conduct multiple ablation studies to justify the design choices made in building the pipeline. Additionally, we use gradient-based interpretability to understand how the codon context and the structural neighbors affect the ribosome dwell time at the A site. By individually analyzing the genes in the dataset, we elucidate how structure neighbors could also potentially play a role in defining the ribosome dwell times. Importantly, since structure neighbors can be far away in the sequence, a recurrent model alone could not easily extract this information. This study lays the foundation for understanding how the mRNA secondary structure can be exploited for dwell time prediction, and how in the future other graph modalities such as features from the nascent polypeptide can be used to further our understanding.

  • Files
  • Details
  • Version
  • Metrics
Loading...
Thumbnail Image
Name

2024.04.08.588507v1.full.pdf

Type

Preprint

Version

http://purl.org/coar/version/c_71e4c1898caa6e32

Access type

openaccess

License Condition

CC BY-NC

Size

2.19 MB

Format

Adobe PDF

Checksum (MD5)

d8178d83d71599134f47506df066ce6d

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés