The biological interpretation of gene lists with interesting shared properties, such as up- or down-regulation in a particular experiment, is typically accomplished using gene ontology enrichment analysis tools. Given a list of genes, a gene ontology (GO) enrichment analysis may return hundreds of statistically significant GO results in a “flat” list, which can be challenging to summarize. It can also be difficult to keep pace with rapidly expanding biological knowledge, which often results in daily changes to any of the over 47,000 gene ontologies that describe biological knowledge. GOATOOLS, a Python-based library, makes it more efficient to stay current with the latest ontologies and annotations, perform gene ontology enrichment analyses to determine over- and under-represented terms, and organize results for greater clarity and easier interpretation using a novel GOATOOLS GO grouping method. We performed functional analyses on both stochastic simulation data and real data from a published RNA-seq study to compare the enrichment results from GOATOOLS to two other popular tools: DAVID and GOstats. GOATOOLS is freely available through GitHub: https://github.com/tanghaibao/goatools.
Orchids make up about 10% of all seed plant species, have great economical value, and are of specific scientific interest because of their renowned flowers and ecological adaptations. Here, we report the first draft genome sequence of a lithophytic orchid, Dendrobium catenatum. We predict 28,910 protein-coding genes, and find evidence of a whole genome duplication shared with Phalaenopsis. We observed the expansion of many resistance-related genes, suggesting a powerful immune system responsible for adaptation to a wide range of ecological niches. We also discovered extensive duplication of genes involved in glucomannan synthase activities, likely related to the synthesis of medicinal polysaccharides. Expansion of MADS-box gene clades ANR1, StMADS11, and MIKC*, involved in the regulation of development and growth, suggests that these expansions are associated with the astonishing diversity of plant architecture in the genus Dendrobium. On the contrary, members of the type I MADS box gene family are missing, which might explain the loss of the endospermous seed. The findings reported here will be important for future studies into polysaccharide synthesis, adaptations to diverse environments and flower architecture of Orchidaceae.
Eucommia ulmoides, also called hardy rubber tree, is an economically important tree; however, the lack of its genome sequence restricts the fundamental biological research and applied studies of this plant species. Here, we present a high-quality assembly of its ∼1.2-Gb genome (scaffold N50 = 1.88 Mb) with at least 26 723 predicted genes for E. ulmoides, the first sequenced genome of the order Garryales, which was obtained using an integrated strategy combining Illumina sequencing, PacBio sequencing, and BioNano mapping. As a sister taxon to lamiids and campanulids, E. ulmoides underwent an ancient genome triplication shared by core eudicots but no further whole-genome duplication in the last ∼125 million years. E. ulmoides exhibits high expression levels and/or gene number expansion for multiple genes involved in stress responses and the biosynthesis of secondary metabolites, which may account for its considerable environmental adaptability. In contrast to the rubber tree (Hevea brasiliensis), which produces cis-polyisoprene, E. ulmoides has evolved to synthesize long-chain trans-polyisoprene via farnesyl diphosphate synthases (FPSs). Moreover, FPS and rubber elongation factor/small rubber particle protein gene families were expanded independently from the H. brasiliensis lineage. These results provide new insights into the biology of E. ulmoides and the origin of polyisoprene biosynthesis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.