Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. The shape of cancer relapse: Topological data analysis predicts recurrence in paediatric acute lymphoblastic leukaemia
 
research article

The shape of cancer relapse: Topological data analysis predicts recurrence in paediatric acute lymphoblastic leukaemia

Chulian, Salvador
•
Stolz, Bernadette  
•
Martinez-Rubio, Alvaro
Show more
August 1, 2023
Plos Computational Biology

Author summaryAcute lymphoblastic leukaemia (ALL) is a blood cancer which affects predominantly children and adolescents. Therapy typically fails in approximately 20% of patients, who suffer from relapse. To determine disease status, clinicians assess cell types, their interactions, as well as deviations from normal behaviour. Flow cytometry (FC) is a method that quantifies the intensity of specific cell markers, giving rise to high-dimensional data. This routinely collected information is then reduced to obtain human-interpretable visualisation for prognosis. Topological Data Analysis (TDA) is a field of mathematics that studies shapes in data, considering isolated data islands and empty spaces between them. We showcase how to use TDA to extract shape characteristics in FC data of relapsing patients. We propose three pipelines, of increasing methodological complexity, to aid clinical decisions for risk stratification in ALL. In combination with Machine Learning, TDA enables high-accuracy predictions of relapse to be made at the time of diagnosis. Although children and adolescents with acute lymphoblastic leukaemia (ALL) have high survival rates, approximately 15-20% of patients relapse. Risk of relapse is routinely estimated at diagnosis by biological factors, including flow cytometry data. This high-dimensional data is typically manually assessed by projecting it onto a subset of biomarkers. Cell density and "empty spaces" in 2D projections of the data, i.e. regions devoid of cells, are then used for qualitative assessment. Here, we use topological data analysis (TDA), which quantifies shapes, including empty spaces, in data, to analyse pre-treatment ALL datasets with known patient outcomes. We combine these fully unsupervised analyses with Machine Learning (ML) to identify significant shape characteristics and demonstrate that they accurately predict risk of relapse, particularly for patients previously classified as 'low risk'. We independently confirm the predictive power of CD10, CD20, CD38, and CD45 as biomarkers for ALL diagnosis. Based on our analyses, we propose three increasingly detailed prognostic pipelines for analysing flow cytometry data from ALL patients depending on technical and technological availability: 1. Visual inspection of specific biological features in biparametric projections of the data; 2. Computation of quantitative topological descriptors of such projections; 3. A combined analysis, using TDA and ML, in the four-parameter space defined by CD10, CD20, CD38 and CD45. Our analyses readily extend to other haematological malignancies.

  • Details
  • Metrics
Type
research article
DOI
10.1371/journal.pcbi.1011329
Web of Science ID

WOS:001050842200007

Author(s)
Chulian, Salvador
Stolz, Bernadette  
Martinez-Rubio, Alvaro
Goni, Cristina Blazquez
Gutierrez, Juan Rodriguez
Velazquez, Teresa Caballero
Quintana, Agueda Molinos
Orellana, Manuel Ramirez
Robleda, Ana Castillo
Soler, Jose Luis Fuster M.
Show more
Date Issued

2023-08-01

Published in
Plos Computational Biology
Volume

19

Issue

8

Article Number

e1011329

Subjects

Biochemical Research Methods

•

Mathematical & Computational Biology

•

Biochemistry & Molecular Biology

•

Mathematical & Computational Biology

•

minimal residual disease

•

flow-cytometry

•

persistent homology

•

bone-marrow

•

classification

•

differentiation

•

diagnosis

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

Available on Infoscience
September 11, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/200617
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés