Although the proteins that read the gene regulatory code, transcription factors (TFs), have been largely identified, it is not well known which sequences TFs can recognize. We have analyzed the sequence-specific binding of human TFs using high-throughput SELEX and ChIP sequencing. A total of 830 binding profiles were obtained, describing 239 distinctly different binding specificities. The models represent the majority of human TFs, approximately doubling the coverage compared to existing systematic studies. Our results reveal additional specificity determinants for a large number of factors for which a partial specificity was known, including a commonly observed A- or T-rich stretch that flanks the core motifs. Global analysis of the data revealed that homodimer orientation and spacing preferences, and base-stacking interactions, have a larger role in TF-DNA binding than previously appreciated. We further describe a binding model incorporating these features that is required to understand binding of TFs to DNA.
The majority of CpG dinucleotides in the human genome are methylated at cytosine bases. However, active gene regulatory elements are generally hypomethylated relative to their flanking regions, and the binding of some transcription factors (TFs) is diminished by methylation of their target sequences. By analysis of 542 human TFs with methylation-sensitive SELEX (systematic evolution of ligands by exponential enrichment), we found that there are also many TFs that prefer CpG-methylated sequences. Most of these are in the extended homeodomain family. Structural analysis showed that homeodomain specificity for methylcytosine depends on direct hydrophobic interactions with the methylcytosine 5-methyl group. This study provides a systematic examination of the effect of an epigenetic DNA modification on human TF binding specificity and reveals that many developmentally important proteins display preference for mCpG-containing sequences.
Gene expression is regulated by transcription factors (TFs), proteins that recognize short DNA sequence motifs. Such sequences are very common in the human genome, and an important determinant of the specificity of gene expression is the cooperative binding of multiple TFs to closely located motifs. However, interactions between DNA-bound TFs have not been systematically characterized. To identify TF pairs that bind cooperatively to DNA, and to characterize their spacing and orientation preferences, we have performed consecutive affinity-purification systematic evolution of ligands by exponential enrichment (CAP-SELEX) analysis of 9,400 TF-TF-DNA interactions. This analysis revealed 315 TF-TF interactions recognizing 618 heterodimeric motifs, most of which have not been previously described. The observed cooperativity occurred promiscuously between TFs from diverse structural families. Structural analysis of the TF pairs, including a novel crystal structure of MEIS1 and DLX3 bound to their identified recognition site, revealed that the interactions between the TFs were predominantly mediated by DNA. Most TF pair sites identified involved a large overlap between individual TF recognition motifs, and resulted in recognition of composite sites that were markedly different from the individual TF's motifs. Together, our results indicate that the DNA molecule commonly plays an active role in cooperative interactions that define the gene regulatory lexicon.
Normal differentiation and induced reprogramming require the activation of target cell programs and silencing of donor cell programs. In reprogramming, the same factors are often used to reprogram many different donor cell types. As most developmental repressors, such as RE1-silencing transcription factor (REST) and Groucho (also known as TLE), are considered lineage-specific repressors, it remains unclear how identical combinations of transcription factors can silence so many different donor programs. Distinct lineage repressors would have to be induced in different donor cell types. Here, by studying the reprogramming of mouse fibroblasts to neurons, we found that the pan neuron-specific transcription factor Myt1-like (Myt1l) exerts its pro-neuronal function by direct repression of many different somatic lineage programs except the neuronal program. The repressive function of Myt1l is mediated via recruitment of a complex containing Sin3b by binding to a previously uncharacterized N-terminal domain. In agreement with its repressive function, the genomic binding sites of Myt1l are similar in neurons and fibroblasts and are preferentially in an open chromatin configuration. The Notch signalling pathway is repressed by Myt1l through silencing of several members, including Hes1. Acute knockdown of Myt1l in the developing mouse brain mimicked a Notch gain-of-function phenotype, suggesting that Myt1l allows newborn neurons to escape Notch activation during normal development. Depletion of Myt1l in primary postmitotic neurons de-repressed non-neuronal programs and impaired neuronal gene expression and function, indicating that many somatic lineage programs are actively and persistently repressed by Myt1l to maintain neuronal identity. It is now tempting to speculate that similar 'many-but-one' lineage repressors exist for other cell fates; such repressors, in combination with lineage-specific activators, would be prime candidates for use in reprogramming additional cell types.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.