Evaluation of methods for modeling transcription factor sequence specificity

Weirauch, Matthew T.; Cote, Atina; Norel, Raquel; Annala, Matti; Zhao, Yue; Riley, Todd R.; Saez-Rodriguez, Julio; Cokelaer, Thomas; Vedenko, Anastasia; Talukder, Shaheynoor; Bussemaker, Harmen J.; Morris, Quaid D.; Bulyk, Martha L.; Stolovitzky, Gustavo; Hughes, Timothy R.; DREAM5, Consortium

doi:10.1038/nbt.2486

Weirauch, Matthew T.; Cote, Atina; Norel, Raquel; Annala, Matti; Zhao, Yue; Riley, Todd R.; Saez-Rodriguez, Julio; Cokelaer, Thomas; Vedenko, Anastasia; Talukder, Shaheynoor; Bussemaker, Harmen J.; Morris, Quaid D.; Bulyk, Martha L.; Stolovitzky, Gustavo; Hughes, Timothy R.; DREAM5, Consortium

2013

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Genomic analyses often involve scanning for potential transcription factor (TF) binding sites using models of the sequence specificity of DNA binding proteins. Many approaches have been developed to model and learn a protein's DNA-binding specificity, but these methods have not been systematically compared. Here we applied 26 such approaches to in vitro protein binding microarray data for 66 mouse TFs belonging to various families. For nine TFs, we also scored the resulting motif models on in vivo data, and found that the best in vitro-derived motifs performed similarly to motifs derived from the in vivo data. Our results indicate that simple models based on mononucleotide position weight matrices trained by the best methods perform similarly to more complex models for most TFs examined, but fall short in specific cases (<10% of the TFs examined here). In addition, the best-performing motifs typically have relatively low information content, consistent with widespread degeneracy in eukaryotic TF sequence preferences

Details

Title Evaluation of methods for modeling transcription factor sequence specificity

Author(s) Weirauch, Matthew T. ; Cote, Atina ; Norel, Raquel ; Annala, Matti ; Zhao, Yue ; Riley, Todd R. ; Saez-Rodriguez, Julio ; Cokelaer, Thomas ; Vedenko, Anastasia ; Talukder, Shaheynoor ; Bussemaker, Harmen J. ; Morris, Quaid D. ; Bulyk, Martha L. ; Stolovitzky, Gustavo ; Hughes, Timothy R. ; DREAM5, Consortium

Published in Nature Biotechnology

Volume 31

Issue 2

Pages 126-134

Date 2013

Publisher Nature Publishing Group

ISSN 1087-0156

DOI https://doi.org/10.1038/nbt.2486

Other identifier(s) View record in PubMed
View record in Web of Science

Laboratories GR-BUCHER

Record Appears in Scientific production and competences > SV - School of Life Sciences > ISREC - Swiss Institute for Experimental Cancer Research > GR-BUCHER - Bucher Group
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2014-02-06

Abstract

Details

Actions