High-throughput SELEX SAGE method for quantitative modeling of transcription-factor binding sites

Roulet, E.; Busso, S.; Camargo, A. A.; Simpson, A. J.; Mermod, N.; Bucher, P.

doi:10.1038/nbt718

Roulet, E.; Busso, S.; Camargo, A. A.; Simpson, A. J.; Mermod, N.; Bucher, P.

2002

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

The ability to determine the location and relative strength of all transcription-factor binding sites in a genome is important both for a comprehensive understanding of gene regulation and for effective promoter engineering in biotechnological applications. Here we present a bioinformatically driven experimental method to accurately define the DNA-binding sequence specificity of transcription factors. A generalized profile was used as a predictive quantitative model for binding sites, and its parameters were estimated from in vitro-selected ligands using standard hidden Markov model training algorithms. Computer simulations showed that several thousand low- to medium-affinity sequences are required to generate a profile of desired accuracy. To produce data on this scale, we applied high-throughput genomics methods to the biochemical problem addressed here. A method combining systematic evolution of ligands by exponential enrichment (SELEX) and serial analysis of gene expression (SAGE) protocols was coupled to an automated quality-controlled sequence extraction procedure based on Phred quality scores. This allowed the sequencing of a database of more than 10,000 potential DNA ligands for the CTF/NFI transcription factor. The resulting binding-site model defines the sequence specificity of this protein with a high degree of accuracy not achieved earlier and thereby makes it possible to identify previously unknown regulatory sequences in genomic DNA. A covariance analysis of the selected sites revealed non-independent base preferences at different nucleotide positions, providing insight into the binding mechanism.

Details

Title High-throughput SELEX SAGE method for quantitative modeling of transcription-factor binding sites

Author(s) Roulet, E. ; Busso, S. ; Camargo, A. A. ; Simpson, A. J. ; Mermod, N. ; Bucher, P.

Published in Nature Biotechnology

Volume 20

Issue 8

Pages 831-835

Date 2002

Note Laboratory of Molecular Biotechnology, Center for Biotechnology UNIL-EPFL, and Institute of Animal Biology, University of Lausanne, 1015 Lausanne, Switzerland.

DOI https://doi.org/10.1038/nbt718

Other identifier(s) View record in Web of Science

Laboratories GR-BUCHER

Record Appears in Scientific production and competences > SV - School of Life Sciences > ISREC - Swiss Institute for Experimental Cancer Research > GR-BUCHER - Bucher Group
Peer-reviewed publications
Work outside EPFL
Journal Articles
Published

Record creation date 2007-12-17

Abstract

Details

Actions