BackgroundThe duplication of genes can occur through various mechanisms and is thought to make a major contribution to the evolutionary diversification of organisms. There is increasing evidence for a large-scale duplication of genes in some chelicerate lineages including two rounds of whole genome duplication (WGD) in horseshoe crabs. To investigate this further, we sequenced and analyzed the genome of the common house spider Parasteatoda tepidariorum.ResultsWe found pervasive duplication of both coding and non-coding genes in this spider, including two clusters of Hox genes. Analysis of synteny conservation across the P. tepidariorum genome suggests that there has been an ancient WGD in spiders. Comparison with the genomes of other chelicerates, including that of the newly sequenced bark scorpion Centruroides sculpturatus, suggests that this event occurred in the common ancestor of spiders and scorpions, and is probably independent of the WGDs in horseshoe crabs. Furthermore, characterization of the sequence and expression of the Hox paralogs in P. tepidariorum suggests that many have been subject to neo-functionalization and/or sub-functionalization since their duplication.ConclusionsOur results reveal that spiders and scorpions are likely the descendants of a polyploid ancestor that lived more than 450 MYA. Given the extensive morphological diversity and ecological adaptations found among these animals, rivaling those of vertebrates, our study of the ancient WGD event in Arachnopulmonata provides a new comparative platform to explore common and divergent evolutionary outcomes of polyploidization events across eukaryotes.Electronic supplementary materialThe online version of this article (doi:10.1186/s12915-017-0399-x) contains supplementary material, which is available to authorized users.
Chelicerata represents one of the oldest groups of arthropods, with a fossil record extending to the Cambrian, and is sister group to the remaining extant arthropods, the mandibulates. Attempts to resolve the internal phylogeny of chelicerates have achieved little consensus, due to marked discord in both morphological and molecular hypotheses of chelicerate phylogeny. The monophyly of Arachnida, the terrestrial chelicerates, is generally accepted, but has garnered little support from molecular data, which have been limited either in breadth of taxonomic sampling or in depth of sequencing. To address the internal phylogeny of this group, we employed a phylogenomic approach, generating transcriptomic data for 17 species in combination with existing data, including two complete genomes. We analyzed multiple data sets containing up to 1,235,912 sites across 3,644 loci, using alternative approaches to optimization of matrix composition. Here, we show that phylogenetic signal for the monophyly of Arachnida is restricted to the 500 slowest-evolving genes in the data set. Accelerated evolutionary rates in Acariformes, Pseudoscorpiones, and Parasitiformes potentially engender long-branch attraction artifacts, yielding nonmonophyly of Arachnida with increasing support upon incrementing the number of concatenated genes. Mutually exclusive hypotheses are supported by locus groups of variable evolutionary rate, revealing significant conflicts in phylogenetic signal. Analyses of gene-tree discordance indicate marked incongruence in relationships among chelicerate orders, whereas derived relationships are demonstrably robust. Consistently recovered and supported relationships include the monophyly of Chelicerata, Euchelicerata, Tetrapulmonata, and all orders represented by multiple terminals. Relationships supported by subsets of slow-evolving genes include Ricinulei + Solifugae; a clade comprised of Ricinulei, Opiliones, and Solifugae; and a clade comprised of Tetrapulmonata, Scorpiones, and Pseudoscorpiones. We demonstrate that outgroup selection without regard for branch length distribution exacerbates long-branch attraction artifacts and does not mitigate gene-tree discordance, regardless of high gene representation for outgroups that are model organisms. Arachnopulmonata (new name) is proposed for the clade comprising Scorpiones + Tetrapulmonata (previously named Pulmonata).
Abstract. To re-evaluate the relationships of the major bivalve lineages, we amassed detailed morpho-anatomical, ultrastructural and molecular sequence data for a targeted selection of exemplar bivalves spanning the phylogenetic diversity of the class. We included molecular data for 103 bivalve species (up to five markers) and also analysed a subset of taxa with four additional nuclear protein-encoding genes. Novel as well as historically employed morphological characters were explored, and we systematically disassembled widely used descriptors such as gill and stomach 'types'. Phylogenetic analyses, conducted using parsimony direct optimisation and probabilistic methods on static alignments (maximum likelihood and Bayesian inference) of the molecular data, both alone and in combination with morphological characters, offer a robust test of bivalve relationships. A calibrated phylogeny also provided insights into the tempo of bivalve evolution. Finally, an analysis of the informativeness of morphological characters showed that sperm ultrastructure characters are among the best morphological features to diagnose bivalve clades, followed by characters of the shell, including its microstructure. Our study found support for monophyly of most broadly recognised higher bivalve taxa, although support was not uniform for Protobranchia. However, monophyly of the bivalves with protobranchiate gills was the best-supported hypothesis with incremental morphological and/or molecular sequence data. Autobranchia,
The tuatara (Sphenodon punctatus)-the only living member of the reptilian order Rhynchocephalia (Sphenodontia), once widespread across Gondwana 1,2-is an iconic species that is endemic to New Zealand 2,3. A key link to the now-extinct stem reptiles (from which dinosaurs, modern reptiles, birds and mammals evolved), the tuatara provides key insights into the ancestral amniotes 2,4. Here we analyse the genome of the tuatara, which-at approximately 5 Gb-is among the largest of the vertebrate genomes yet assembled. Our analyses of this genome, along with comparisons with other vertebrate genomes, reinforce the uniqueness of the tuatara. Phylogenetic analyses indicate that the tuatara lineage diverged from that of snakes and lizards around 250 million years ago. This lineage also shows moderate rates of molecular evolution, with instances of punctuated evolution. Our genome sequence analysis identifies expansions of proteins, non-protein-coding RNA families and repeat elements, the latter of which show an amalgam of reptilian and mammalian features. The sequencing of the tuatara genome provides a valuable resource for deep comparative analyses of tetrapods, as well as for tuatara biology and conservation. Our study also provides important insights into both the technical challenges and the cultural obligations that are associated with genome sequencing.
IntroductionTraditionally, genomic or transcriptomic data have been restricted to a few model or emerging model organisms, and to a handful of species of medical and/or environmental importance. Next-generation sequencing techniques have the capability of yielding massive amounts of gene sequence data for virtually any species at a modest cost. Here we provide a comparative analysis of de novo assembled transcriptomic data for ten non-model species of previously understudied animal taxa.ResultscDNA libraries of ten species belonging to five animal phyla (2 Annelida [including Sipuncula], 2 Arthropoda, 2 Mollusca, 2 Nemertea, and 2 Porifera) were sequenced in different batches with an Illumina Genome Analyzer II (read length 100 or 150 bp), rendering between ca. 25 and 52 million reads per species. Read thinning, trimming, and de novo assembly were performed under different parameters to optimize output. Between 67,423 and 207,559 contigs were obtained across the ten species, post-optimization. Of those, 9,069 to 25,681 contigs retrieved blast hits against the NCBI non-redundant database, and approximately 50% of these were assigned with Gene Ontology terms, covering all major categories, and with similar percentages in all species. Local blasts against our datasets, using selected genes from major signaling pathways and housekeeping genes, revealed high efficiency in gene recovery compared to available genomes of closely related species. Intriguingly, our transcriptomic datasets detected multiple paralogues in all phyla and in nearly all gene pathways, including housekeeping genes that are traditionally used in phylogenetic applications for their purported single-copy nature.ConclusionsWe generated the first study of comparative transcriptomics across multiple animal phyla (comparing two species per phylum in most cases), established the first Illumina-based transcriptomic datasets for sponge, nemertean, and sipunculan species, and generated a tractable catalogue of annotated genes (or gene fragments) and protein families for ten newly sequenced non-model organisms, some of commercial importance (i.e., Octopus vulgaris). These comprehensive sets of genes can be readily used for phylogenetic analysis, gene expression profiling, developmental analysis, and can also be a powerful resource for gene discovery. The characterization of the transcriptomes of such a diverse array of animal species permitted the comparison of sequencing depth, functional annotation, and efficiency of genomic sampling using the same pipelines, which proved to be similar for all considered species. In addition, the datasets revealed their potential as a resource for paralogue detection, a recurrent concern in various aspects of biological inquiry, including phylogenetics, molecular evolution, development, and cellular biochemistry.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.