We present a de novo assembly of a eukaryote transcriptome using 454 pyrosequencing data. The Glanville fritillary butterfly (Melitaea cinxia; Lepidoptera: Nymphalidae) is a prominent species in population biology but had no previous genomic data. Sequencing runs using two normalized complementary DNA collections from a genetically diverse pool of larvae, pupae, and adults yielded 608,053 expressed sequence tags (mean length = 110 nucleotides), which assembled into 48,354 contigs (sets of overlapping DNA segments) and 59,943 singletons. BLAST comparisons confirmed the accuracy of the sequencing and assembly, and indicated the presence of c. 9000 unique genes, along with > 6000 additional microarray-confirmed unannotated contigs. Average depth of coverage was 6.5-fold for the longest 4800 contigs (348-2849 bp in length), sufficient for detecting large numbers of single nucleotide polymorphisms. Oligonucleotide microarray probes designed from the assembled sequences showed highly repeatable hybridization intensity and revealed biological differences among individuals. We conclude that 454 sequencing, when performed to provide sufficient coverage depth, allows de novo transcriptome assembly and a fast, cost-effective, and reliable method for development of functional genomic tools for nonmodel species. This development narrows the gap between approaches based on model organisms with rich genetic resources vs. species that are most tractable for ecological and evolutionary studies.
Evolution may depend more strongly on variation in gene expression than on differences between variant forms of proteins. Regions of DNA that affect gene expression are highly variable, containing 0.6% polymorphic sites. These naturally occurring polymorphic nucleotides can alter in vivo transcription rates. Thus, one might expect substantial variation in gene expression between individuals. But the natural variation in mRNA expression for a large number of genes has not been measured. Here we report microarray studies addressing the variation in gene expression within and between natural populations of teleost fish of the genus Fundulus. We observed statistically significant differences in expression between individuals within the same population for approximately 18% of 907 genes. Expression typically differed by a factor of 1.5, and often more than 2.0. Differences between populations increased the variation. Much of the variation between populations was a positive function of the variation within populations and thus is most parsimoniously described as random. Some genes showed unexpected patterns of expression--changes unrelated to evolutionary distance. These data suggest that substantial natural variation exists in gene expression and that this quantitative variation is important in evolution.
Atlantic killifish populations have rapidly adapted to normally lethal levels of pollution in four urban estuaries. Through analysis of 384 whole killifish genome sequences and comparative transcriptomics in four pairs of sensitive and tolerant populations, we identify the aryl hydrocarbon receptor-based signaling pathway as a shared target of selection. This suggests evolutionary constraint on adaptive solutions to complex toxicant mixtures at each site. However, distinct molecular variants apparently contribute to adaptive pathway modification among tolerant populations. Selection also targets other toxicity-mediating genes, and genes of connected signaling pathways, indicating complex tolerance phenotypes and potentially compensatory adaptations. Molecular changes are consistent with selection on standing genetic variation. In killifish high nucleotide diversity has likely been a crucial substrate for selective sweeps to propel rapid adaptation.
Variation among populations in gene expression should be related to the accumulation of random-neutral changes and evolution by natural selection. The following evolutionary analysis has general applicability to biological and medical science because it accounts for genetic relatedness and identifies patterns of expression variation that are affected by natural selection. To identify genes evolving by natural selection, we allocate the maximum amongpopulation variation to genetic distance and then examine the remaining variation relative to a hypothesized important ecological parameter (temperature). These analyses measure the expression of metabolic genes in common-gardened populations of the fish Fundulus heteroclitus whose habitat is distributed along a steep thermal gradient. Although much of the variation in gene expression fits a null model of neutral drift, the variation in expression for 22% of the genes that regress with habitat temperature was far greater than could be accounted for by genetic distance alone. The most parsimonious explanation for amongpopulation variation for these genes is evolution by natural selection. In addition, many metabolic genes have patterns of variation incongruent with neutral evolution: They have too much or too little variation. These patterns of biological variation in expression may reflect important physiological or ecological functions.evolutionary analysis ͉ Fundulus ͉ microarray ͉ phylogenetic comparative approach ͉ genomics G ene expression has been hypothesized to be of adaptive importance (1), and heritable variation that affects fitness is necessary for evolution by natural selection. Although adaptive differences in expression have been identified in single-gene studies (2-6; see ref. 7 for review), microarray approaches offer great promise to rigorously address this hypothesis because they assay many loci at once. Furthermore, it is generally agreed that much of variation in gene expression for a particular environmental condition has a genetic basis (8, 9) according to studies in yeast (10-13), Drosophila (14-18), mice (19), and humans (20)(21)(22).Widespread heritable variation combined with extensive natural variation in gene expression revealed by microarray studies (11,13,14,(23)(24)(25)(26) provides the substrates for evolution. However, two evolutionary forces govern the variance of traits among taxa: neutral drift and natural selection. Under a neutral drift model, the variation in a trait has little biological effect § and is a function of genetic distance: Traits will be more similar among closely related taxa than among more distantly related taxa (27)(28)(29). If natural selection has occurred, the variation in a trait affects an organism's fitness and is a function of the ecological setting: Traits are conserved or diverge depending on the specific ecological pressures (30). Recent studies suggest that much of the extensive variation in gene expression among individuals and taxa is simply random neutral divergence (31, 32), whereas others have fo...
Heritable variation in regulatory or coding regions is the raw material for evolutionary processes. The advent of microarrays has recently promoted examination of the extent of variation in gene expression within and among taxa and examination of the evolutionary processes affecting variation. This review examines these issues. We find: (i) microarray-based measures of gene expression are precise given appropriate experimental design; (ii) there is large interindividual variation, which is composed of a minor nongenetic component and a large heritable component; (iii) variation among populations and species appears to be affected primarily by neutral drift and stabilizing selection, and to a lesser degree by directional selection; and (iv) neutral evolutionary divergence in gene expression becomes nonlinear with greater divergence times due to functional constraint. Evolutionary analyses of gene expression reviewed here provide unique insights into partitioning of regulatory variation in nature. However, common limitations of these studies include the tendency to assume a linear relationship between expression divergence and species divergence, and failure to test explicit hypotheses that involve the ecological context of evolutionary divergence.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.