Computational identification and experimental characterization of preferred downstream positions in human core promoters
Author summary Transcription of genes by the RNA polymerase II enzyme initiates at a genomic region termed the core promoter. The core promoter is a regulatory region that may contain diverse short DNA sequence motifs/elements that confer specific properties to it. Interestingly, core promoter motifs can be located both upstream and downstream of the transcription start site. Variable compositions of core promoter elements were identified. The initiator (Inr) motif and the downstream core promoter element (DPE) is a combination of elements that has been identified and extensively characterized in fruit flies. Although a few Inr+DPE -containing human promoters were identified, the presence of transcriptionally important downstream core promoter positions within human promoters has been a matter of controversy in the literature. Here, using a newly-designed motif discovery strategy, we discovered preferred downstream positions in human promoters that resemble fruit fly DPE. Clustering of the corresponding sequence motifs in eight additional species indicated that such promoters could be common to multicellular non-plant organisms. Importantly, functional characterization of the newly discovered preferred downstream positions supports the existence of Inr+DPE-containing promoters in human genes. Metazoan core promoters, which direct the initiation of transcription by RNA polymerase II (Pol II), may contain short sequence motifs termed core promoter elements/motifs (e.g. the TATA box, initiator (Inr) and downstream core promoter element (DPE)), which recruit Pol II via the general transcription machinery. The DPE was discovered and extensively characterized in Drosophila, where it is strictly dependent on both the presence of an Inr and the precise spacing from it. Since the Drosophila DPE is recognized by the human transcription machinery, it is most likely that some human promoters contain a downstream element that is similar, though not necessarily identical, to the Drosophila DPE. However, only a couple of human promoters were shown to contain a functional DPE, and attempts to computationally detect human DPE-containing promoters have mostly been unsuccessful. Using a newly-designed motif discovery strategy based on Expectation-Maximization probabilistic partitioning algorithms, we discovered preferred downstream positions (PDP) in human promoters that resemble the Drosophila DPE. Available chromatin accessibility footprints revealed that Drosophila and human Inr+DPE promoter classes are not only highly structured, but also similar to each other, particularly in the proximal downstream region. Clustering of the corresponding sequence motifs using a neighbor-joining algorithm strongly suggests that canonical Inr+DPE promoters could be common to metazoan species. Using reporter assays we demonstrate the contribution of the identified downstream positions to the function of multiple human promoters. Furthermore, we show that alteration of the spacing between the Inr and PDP by two nucleotides results in reduced promoter activity, suggesting a spacing dependency of the newly discovered human PDP on the Inr. Taken together, our strategy identified novel functional downstream positions within human core promoters, supporting the existence of DPE-like motifs in human promoters.
journal.pcbi.1009256.pdf
publisher
openaccess
CC BY
2.64 MB
Adobe PDF
2cfb3821f675d9d298e213902ffacf27