Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal. Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. We find that few genes are truly ‘housekeeping’, whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles. TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved. Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs. The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses. The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research.
The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations.
The Mouse Genome Database, (MGD, http://www.informatics.jax.org/), integrates genetic, genomic and phenotypic information about the laboratory mouse, a primary animal model for studying human biology and disease. MGD data content includes comprehensive characterization of genes and their functions, standardized descriptions of mouse phenotypes, extensive integration of DNA and protein sequence data, normalized representation of genome and genome variant information including comparative data on mammalian genes. Data within MGD are obtained from diverse sources including manual curation of the biomedical literature, direct contributions from individual investigator's laboratories and major informatics resource centers such as Ensembl, UniProt and NCBI. MGD collaborates with the bioinformatics community on the development of data and semantic standards such as the Gene Ontology (GO) and the Mammalian Phenotype (MP) Ontology. MGD provides a data-mining platform that enables the development of translational research hypotheses based on comparative genotype, phenotype and functional analyses. Both web-based querying and computational access to data are provided. Recent improvements in MGD described here include the association of gene trap data with mouse genes and a new batch query capability for customized data access and retrieval.
The Gene Ontology (GO) project (http://www.geneontology.org/) provides a set of structured, controlled vocabularies for community use in annotating genes, gene products and sequences (also see http://www.sequenceontology.org/). The ontologies have been extended and refined for several biological areas, and improvements to the structure of the ontologies have been implemented. To improve the quantity and quality of gene product annotations available from its public repository, the GO Consortium has launched a focused effort to provide comprehensive and detailed annotation of orthologous genes across a number of ‘reference’ genomes, including human and several key model organisms. Software developments include two releases of the ontology-editing tool OBO-Edit, and improvements to the AmiGO browser interface.
Pax genes encode a family of transcription factors that orchestrate complex processes of lineage determination in the developing embryo. Their key role is to specify and maintain progenitor cells through use of complex molecular mechanisms such as alternate RNA splice forms and gene activation or inhibition in conjunction with protein co-factors. The significance of Pax genes in development is highlighted by abnormalities that arise from the expression of mutant Pax genes. Here, we review the molecular functions of Pax genes during development and detail the regulatory mechanisms by which they specify and maintain progenitor cells across various tissue lineages. We also discuss mechanistic insights into the roles of Pax genes in regeneration and in adult diseases, including cancer. KEY WORDS: Pax genes, Embryogenesis, Lineage determination IntroductionPaired box (Pax) genes encode transcription factors that contain a highly conserved DNA-binding domain called the paired domain (PD, Fig. 1A) and can be considered to be a principle regulator of gene expression. Nine Pax genes (Pax1-Pax9) have been characterised in mammals and the evolutionary conserved paired domain has been identified across phylogenies from insects, to amphibians and birds. In higher vertebrates, PAX proteins are subclassified into groups according to inclusion of an additional DNA-binding homeodomain and/or an octapeptide region, which serves as a binding motif for protein co-factors for potent inhibition of downstream gene transcription (Eberhard et al., 2000) (Fig. 1B); all PAX proteins include a transactivation domain located within the C-terminal amino acids (Underhill, 2012). It is also known that all Pax genes, with the exception of Pax4 and Pax9, produce alternative RNA transcripts (see Table 1). The functional diversity of Pax proteins in vivo is thus linked to the ability to produce alternatively spliced gene products that differ in structure and, consequently, in the binding activity of their paired and homeodomain DNA-binding regions (Underhill, 2012).Three decades ago, the characterisation and roles of Pax genes in embryonic development began to unfold. Early studies discovered that regulatory gene families such as the Pax family are involved in the sequential compartmentalisation and body patterning of developing organisms; thereafter, studies highlighted a role for Pax genes in the early specification of cell fate and the subsequent morphogenesis of various tissues and organs. Following this, mutational studies of Pax genes confirmed the importance of these regulatory roles in the PRIMER School of Medical Sciences, Edith Cowan University, Joondalup, WA 6027, Australia.*Author for correspondence (j.blake@ecu.edu.au) This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.initiation and progression of ...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.