Geraniaceae plastid genomes (plastomes) have experienced a remarkable number of genomic changes. The plastomes of Erodium texanum, Geranium palmatum, and Monsonia speciosa were sequenced and compared with other rosids and the previously published Pelargonium hortorum plastome. Geraniaceae plastomes were found to be highly variable in size, gene content and order, repetitive DNA, and codon usage. Several unique plastome rearrangements include the disruption of two highly conserved operons (S10 and rps2-atpA), and the inverted repeat (IR) region in M. speciosa does not contain all genes in the ribosomal RNA operon. The sequence of M. speciosa is unusually small (128,787 bp); among angiosperm plastomes sequenced to date, only those of nonphotosynthetic species and those that have lost one IR copy are smaller. In contrast, the plastome of P. hortorum is the largest, at 217,942 bp. These genomes have experienced numerous gene and intron losses and partial and complete gene duplications. Some of the losses are shared throughout the family (e.g., trnT-GGU and the introns of rps16 and rpl16); however, other losses are homoplasious (e.g., trnG-UCC intron in G. palmatum and M. speciosa). IR length is also highly variable. The IR in P. hortorum was previously shown to be greatly expanded to 76 kb, and the IR is lost in E. texanum and reduced in G. palmatum (11 kb) and M. speciosa (7 kb). Geraniaceae plastomes contain a high frequency of large repeats (>100 bp) relative to other rosids. Within each plastome, repeats are often located at rearrangement end points and many repeats shared among the four Geraniaceae flank rearrangement end points. GC content is elevated in the genomes and also in coding regions relative to other rosids. Codon usage per amino acid and GC content at third position sites are significantly different for Geraniaceae protein-coding sequences relative to other rosids. Our findings suggest that relaxed selection and/or mutational biases lead to increased GC content, and this in turn altered codon usage. We propose that increases in genomic rearrangements, repetitive DNA, nucleotide substitutions, and GC content may be caused by relaxed selection resulting from improper DNA repair.
The plastid genome of Trifolium subterraneum is 144,763 bp, about 20 kb longer than those of closely related legumes, which also lost one copy of the large inverted repeat (IR). The genome has undergone extensive genomic reconfiguration, including the loss of six genes (accD, infA, rpl22, rps16, rps18, and ycf1) and two introns (clpP and rps12) and numerous gene order changes, attributable to 14-18 inversions. All endpoints of rearranged gene clusters are flanked by repeated sequences, tRNAs, or pseudogenes. One unusual feature of the Trifolium subterraneum genome is the large number of dispersed repeats, which comprise 19.5% (ca. 28 kb) of the genome (versus about 4% for other angiosperms) and account for part of the increase in genome size. Nine genes (psbT, rbcL, clpP, rps3, rpl23, atpB, psbN, trnI-cau, and ycf3) have also been duplicated either partially or completely. rpl23 is the most highly duplicated gene, with portions of this gene duplicated six times. Comparisons of the Trifolium plastid genome with the Plant Repeat Database and searches for flanking inverted repeats suggest that the high incidence of dispersed repeats and rearrangements is not likely the result of transposition. Trifolium has 19.5 kb of unique DNA distributed among 160 fragments ranging in size from 30 to 494 bp, greatly surpassing the other five sequenced legume plastid genomes in novel DNA content. At least some of this unique DNA may represent horizontal transfer from bacterial genomes. These unusual features provide direction for the development of more complex models of plastid genome evolution.
Plastid genomes of the grasses (Poaceae) are unusual in their organization and rates of sequence evolution. There has been a recent surge in the availability of grass plastid genome sequences, but a comprehensive comparative analysis of genome evolution has not been performed that includes any related families in the Poales. We report on the plastid genome of Typha latifolia, the first non-grass Poales sequenced to date, and we present comparisons of genome organization and sequence evolution within Poales. Our results confirm that grass plastid genomes exhibit acceleration in both genomic rearrangements and nucleotide substitutions. Poaceae have multiple structural rearrangements, including three inversions, three genes losses (accD, ycf1, ycf2), intron losses in two genes (clpP, rpoC1), and expansion of the inverted repeat (IR) into both large and small single-copy regions. These rearrangements are restricted to the Poaceae, and IR expansion into the small single-copy region correlates with the phylogeny of the family. Comparisons of 73 protein-coding genes for 47 angiosperms including nine Poaceae genera confirm that the branch leading to Poaceae has significantly accelerated rates of change relative to other monocots and angiosperms. Furthermore, rates of sequence evolution within grasses are lower, indicating a deceleration during diversification of the family. Overall there is a strong correlation between accelerated rates of genomic rearrangements and nucleotide substitutions in Poaceae, a phenomenon that has been noted recently throughout angiosperms. The cause of the correlation is unknown, but faulty DNA repair has been suggested in other systems including bacterial and animal mitochondrial genomes.Electronic supplementary materialThe online version of this article (doi:10.1007/s00239-009-9317-3) contains supplementary material, which is available to authorized users.
Angiosperm plastid genomes are generally conserved in gene content and order with rates of nucleotide substitutions for protein-coding genes lower than for nuclear protein-coding genes. A few groups have experienced genomic change, and extreme changes in gene content and order are found within the flowering plant family Geraniaceae. The complete plastid genome sequence of Pelargonium X hortorum (Geraniaceae) reveals the largest and most rearranged plastid genome identified to date. Highly elevated rates of sequence evolution in Geraniaceae mitochondrial genomes have been reported, but rates in Geraniaceae plastid genomes have not been characterized. Analysis of nucleotide substitution rates for 72 plastid genes for 47 angiosperm taxa, including nine Geraniaceae, show that values of dN are highly accelerated in ribosomal protein and RNA polymerase genes throughout the family. Furthermore, dN/dS is significantly elevated in the same two classes of plastid genes as well as in ATPase genes. A relatively high dN/dS ratio could be interpreted as evidence of two phenomena, namely positive or relaxed selection, neither of which is consistent with our current understanding of plastid genome evolution in photosynthetic plants. These analyses are the first to use protein-coding sequences from complete plastid genomes to characterize rates and patterns of sequence evolution for a broad sampling of photosynthetic angiosperms, and they reveal unprecedented accumulation of nucleotide substitutions in Geraniaceae. To explain these remarkable substitution patterns in the highly rearranged Geraniaceae plastid genomes, we propose a model of aberrant DNA repair coupled with altered gene expression.comparative genomics ͉ genome evolution ͉ plastid genome A ngiosperm plastid genomes are generally highly conserved in gene order, gene content, and organization (1). Whereas the rates of nucleotide substitutions are highly variable in protein-coding genes of angiosperm nuclear genomes, rates in plastid genes are generally lower (2). Rates of nonsynonomous substitutions (dN), those that cause an amino acid change, are substantially lower than rates of synonymous substitutions (dS), those that do not cause an amino acid change. Aside from a recent report describing elevated dN for a single gene in Oenothera and lineages within Caryophyllaceae (3), plastid genes of photosynthetic plants are under strong purifying selection and the rapid accumulation of either dN or dS has not been described.The plastid genomes of nonphotosynthetic plants reveal accelerated rates of nucleotide substitutions in many proteincoding genes; furthermore, these genomes exhibit extensive gene loss and genome rearrangement (4-6). However, analyses involving either few genes or few taxa for photosynthetic angiosperm plastid genomes generally reveal that modest rate variation is locus-and lineage-specific. A few groups of angiosperms have experienced lineage-specific rate variation, including the lineages leading to the grasses (7), pea (2), Gnetum (8), and Welwitschia...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.