This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
Plant specialized metabolites have ecological functions, yet the presence of numerous uncharacterized biosynthetic genes in plant genomes suggests that many molecules remain unknown. We discovered a triterpene biosynthetic network in the roots of the small mustard plant Arabidopsis thaliana. Collectively, we have elucidated and reconstituted three divergent pathways for the biosynthesis of root triterpenes, namely thalianin (seven steps), thalianyl medium-chain fatty acid esters (three steps), and arabidin (five steps). A. thaliana mutants disrupted in the biosynthesis of these compounds have altered root microbiota. In vitro bioassays with purified compounds reveal selective growth modulation activities of pathway metabolites toward root microbiota members and their biochemical transformation and utilization by bacteria, supporting a role for this biosynthetic network in shaping an Arabidopsis-specific root microbial community.
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 'transcriptional units', contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense-antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.
Differentiation of secondary metabolite profiles in closely related plant species provide clues for unravelling biosynthetic pathways and regulatory circuits, an area that is still under-investigated. Cucurbitacins, a group of bitter and highly oxygenated tetracyclic triterpenes, are mainly produced by the plant family Cucurbitaceae. These compounds have similar structures, but differ in their anti-tumor activities and eco-physiological roles. By comparative analyses of the genomes of cucumber, melon, and watermelon, we uncovered conserved syntenic loci encoding metabolic genes for distinct cucurbitacins. Characterization of the cytochrome P450s (CYPs) identified from these loci enabled us to unveil a novel multi-oxidation CYP for the tailoring of the cucurbitacin core skeleton as well as two other CYPs responsible for the key structural variations among cucurbitacins C, B and E. We also discovered a syntenic gene cluster of transcription factors that regulate the tissue-specific biosynthesis of cucurbitacins and that may confer the loss of bitterness phenotypes associated with convergent domestication of wild cucurbits. This study illustrates the potential to exploit comparative genomics to identify enzymes and transcription factors that control the biosynthesis of structurally related yet unique natural products.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.