BackgroundA major goal of molecular biology is determining the mechanisms that control the transcription of genes. Motif Enrichment Analysis (MEA) seeks to determine which DNA-binding transcription factors control the transcription of a set of genes by detecting enrichment of known binding motifs in the genes' regulatory regions. Typically, the biologist specifies a set of genes believed to be co-regulated and a library of known DNA-binding models for transcription factors, and MEA determines which (if any) of the factors may be direct regulators of the genes. Since the number of factors with known DNA-binding models is rapidly increasing as a result of high-throughput technologies, MEA is becoming increasingly useful. In this paper, we explore ways to make MEA applicable in more settings, and evaluate the efficacy of a number of MEA approaches.ResultsWe first define a mathematical framework for Motif Enrichment Analysis that relaxes the requirement that the biologist input a selected set of genes. Instead, the input consists of all regulatory regions, each labeled with the level of a biological signal. We then define and implement a number of motif enrichment analysis methods. Some of these methods require a user-specified signal threshold, some identify an optimum threshold in a data-driven way and two of our methods are threshold-free. We evaluate these methods, along with two existing methods (Clover and PASTAA), using yeast ChIP-chip data. Our novel threshold-free method based on linear regression performs best in our evaluation, followed by the data-driven PASTAA algorithm. The Clover algorithm performs as well as PASTAA if the user-specified threshold is chosen optimally. Data-driven methods based on three statistical tests–Fisher Exact Test, rank-sum test, and multi-hypergeometric test—perform poorly, even when the threshold is chosen optimally. These methods (and Clover) perform even worse when unrestricted data-driven threshold determination is used.ConclusionsOur novel, threshold-free linear regression method works well on ChIP-chip data. Methods using data-driven threshold determination can perform poorly unless the range of thresholds is limited a priori. The limits implemented in PASTAA, however, appear to be well-chosen. Our novel algorithms—AME (Analysis of Motif Enrichment)—are available at http://bioinformatics.org.au/ame/.
Neural progenitor cells have the ability to give rise to neurons and glia in the embryonic, postnatal and adult brain. During development, the program regulating whether these cells divide and self-renew or exit the cell cycle and differentiate is tightly controlled, and imbalances to the normal trajectory of this process can lead to severe functional consequences. However, our understanding of the molecular regulation of these fundamental events remains limited. Moreover, processes underpinning development of the postnatal neurogenic niches within the cortex remain poorly defined. Here, we demonstrate that Nuclear factor one X (NFIX) is expressed by neural progenitor cells within the embryonic hippocampus, and that progenitor cell differentiation is delayed within Nfix(-/-) mice. Moreover, we reveal that the morphology of the dentate gyrus in postnatal Nfix(-/-) mice is abnormal, with fewer subgranular zone neural progenitor cells being generated in the absence of this transcription factor. Mechanistically, we demonstrate that the progenitor cell maintenance factor Sry-related HMG box 9 (SOX9) is upregulated in the hippocampus of Nfix(-/-) mice and demonstrate that NFIX can repress Sox9 promoter-driven transcription. Collectively, our findings demonstrate that NFIX plays a central role in hippocampal morphogenesis, regulating the formation of neuronal and glial populations within this structure.
Supplementary data are available at Bioinformatics online.
All scripts, motifs and data presented in this article are available online at http://research.imb.uq.edu.au/t.bailey/supplementary_data/McLeay2011a.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.