Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Two-Tier Mapper, an unbiased topology-based clustering method for enhanced global gene expression analysis
 
research article

Two-Tier Mapper, an unbiased topology-based clustering method for enhanced global gene expression analysis

Jeitziner, Rachel  
•
Carrière, Mathieu
•
Rougemont, Jacques
Show more
February 7, 2019
Bioinformatics

Motivation: Unbiased clustering methods are needed to analyze growing numbers of complex data sets. Currently available clustering methods often depend on parameters that are set by the user, they lack stability, and are not applicable to small data sets. To overcome these shortcomings we used topological data analysis, an emerging field of mathematics that can discerns additional feature and discover hidden insights on data sets and has a wide application range. Results: We have developed a topology-based clustering method called Two-Tier Mapper (TTMap) for enhanced analysis of global gene expression datasets. First, TTMap discerns divergent features in the control group, adjusts for them, and identifies outliers. Second, the deviation of each test sample from the control group in a high-dimensional space is computed, and the test samples are clustered using a new Mapper-based topological algorithm at two levels: a global tier and local tiers. All parameters are either carefully chosen or data-driven, avoiding any user-induced bias. The method is stable, different datasets can be combined for analysis, and significant subgroups can be identified. It outperforms current clustering methods in sensitivity and stability on synthetic and biological datasets, in particular when sample sizes are small; outcome is not affected by removal of control samples, by choice of normalization, or by subselection of data. TTMap is readily applicable to complex, highly variable biological samples and holds promise for personalized medicine.

  • Details
  • Metrics
Type
research article
DOI
10.1093/bioinformatics/btz052
Author(s)
Jeitziner, Rachel  
Carrière, Mathieu
Rougemont, Jacques
Oudot, Steve
Hess, Kathryn  
Brisken, Cathrin  
Berger, Bonnie
Date Issued

2019-02-07

Published in
Bioinformatics
Volume

35

Issue

18

Start page

3339

End page

3347

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
UPBRI  
Available on Infoscience
June 4, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/156657
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés