Two-Tier Mapper, an unbiased topology-based clustering method for enhanced global gene expression analysis

Jeitziner, Rachel; Carrière, Mathieu; Rougemont, Jacques; Oudot, Steve; Hess, Kathryn; Brisken, Cathrin; Berger, Bonnie

doi:10.1093/bioinformatics/btz052

Jeitziner, Rachel; Carrière, Mathieu; Rougemont, Jacques; Oudot, Steve; Hess, Kathryn; Brisken, Cathrin; Berger, Bonnie

2019

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Motivation: Unbiased clustering methods are needed to analyze growing numbers of complex data sets. Currently available clustering methods often depend on parameters that are set by the user, they lack stability, and are not applicable to small data sets. To overcome these shortcomings we used topological data analysis, an emerging field of mathematics that can discerns additional feature and discover hidden insights on data sets and has a wide application range. Results: We have developed a topology-based clustering method called Two-Tier Mapper (TTMap) for enhanced analysis of global gene expression datasets. First, TTMap discerns divergent features in the control group, adjusts for them, and identifies outliers. Second, the deviation of each test sample from the control group in a high-dimensional space is computed, and the test samples are clustered using a new Mapper-based topological algorithm at two levels: a global tier and local tiers. All parameters are either carefully chosen or data-driven, avoiding any user-induced bias. The method is stable, different datasets can be combined for analysis, and significant subgroups can be identified. It outperforms current clustering methods in sensitivity and stability on synthetic and biological datasets, in particular when sample sizes are small; outcome is not affected by removal of control samples, by choice of normalization, or by subselection of data. TTMap is readily applicable to complex, highly variable biological samples and holds promise for personalized medicine.

Details

Title Two-Tier Mapper, an unbiased topology-based clustering method for enhanced global gene expression analysis

Author(s) Jeitziner, Rachel ; Carrière, Mathieu ; Rougemont, Jacques ; Oudot, Steve ; Hess, Kathryn ; Brisken, Cathrin ; Berger, Bonnie

Published in Bioinformatics

Volume 35

Issue 18

Pages 3339–3347

Date 2019-02-07

DOI https://doi.org/10.1093/bioinformatics/btz052

Laboratories UPBRI

Record Appears in Scientific production and competences > SV - School of Life Sciences > ISREC - Swiss Institute for Experimental Cancer Research > UPBRI - Prof. Brisken Group
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2019-06-04

Abstract

Details

Actions