We present the results of two exploratory parsimony analyses of DNA sequences from 475 and 499 species of seed plants, respectively, representing all major taxonomic groups. The data are exclusively from the chloroplast gene rbcL, which codes for the large subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO or RuBPCase). We used two different state-transformation assumptions resulting in two sets of cladograms: (i) equal-weighting for the 499-taxon analysis; and (ii) a procedure that differentially weights transversions over transitions within characters and codon positions among characters for the 475-taxon analysis. The degree of congruence between these results and other molecular, as well as morphological, cladistic studies indicates that rbcL sequence variation contains historical evidence appropriate for phylogenetic analysis at this taxonomic level of sampling. Because the topologies presented are necessarily approximate and cannot be evaluated adequately for internal support, these results should be assessed from the perspective of their predictive value and used to direct future studies, both molecular and morphological. In both analyses, the three genera of Gnetales are placed together as the sister group of the flowering plants, and the anomalous aquatic Ceratophyllum (Ceratophyllaceae) is sister to all other flowering plants. Several major lineages identified correspond well with at least some recent taxonomic schemes for angiosperms, particularly those of Dahlgren and Thorne. The basalmost clades within the angiosperms are orders of the apparently polyphyletic subclass Magnoliidae sensu Cronquist. The most conspicuous feature of the topology is that the major division is not monocot versus dicot, but rather one correlated with general pollen type: uniaperturate versus triaperturate. The Dilleniidae and Hamamelidae are the only subclasses that are grossly polyphyletic; an examination of the latter is presented as an example of the use of these broad analyses to focus more restricted studies. A broadly circumscribed Rosidae is paraphyletic to Asteridae and Dilleniidae. Subclass Caryophyllidae is monophyletic and derived from within Rosidae in the 475-taxon analysis but is sister to a group composed of broadly delineated Asteridae and Rosidae in the 499-taxon study.
DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF-atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK-psbI spacer, and trnH-psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL؉matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.matK ͉ rbcL ͉ species identification L arge-scale standardized sequencing of the mitochondrial gene CO1 has made DNA barcoding an efficient species identification tool in many animal groups (1). In plants, however, low substitution rates of mitochondrial DNA have led to the search for alternative barcoding regions. From initial investigations of plastid regions (2-4), 7 leading candidates have emerged (5, 6). Four are portions of coding genes (matK, rbcL, rpoB, and rpoC1), and 3 are noncoding spacers (atpF-atpH, trnH-psbA, and psbK-psbI). Different research groups have proposed various combinations of these loci as their preferred plant barcodes, but no consensus has emerged (5-12). This lack of an agreed standard has impeded progress in plant barcoding.Our aim here is to identify a standard DNA barcode for land plants. To achieve this goal, we have pooled data across laboratories including sequence data from 907 samples, representing 445 angiosperm, 38 gymnosperm, and 67 cryptogam species. Using various subsets of these data, we evaluated the 7 candidate loci using criteria in the Consortium for the Barcode of Life's (CBOL) data standards and guidelines for locus selection (http:// www.barcoding.si.edu/protocols.html). Universality: Which loci can be routinely sequenced across the land plants? Sequence quality and coverage: Which loci are most amenable to the production of bidirectional sequences with few or no ambiguous base calls? Discrimination: Which loci enable most species to be distinguished? ResultsUniversality. Direct universality assessments using a single primer pair for each locus in angiosperms resulted in 90%-98% PCR and sequencing success for 6/7 regions. Success for the seventh region, psbK-psbI, was 77% (Fig. 1A). Greater problems were encountered in other land plant groups, with rpoB, matK, atpF-atpH, and psbK-psbI all showing Ͻ50% success in gymnosperms and/or cryptogams based on data compiled from several laboratories (Fig. 1 A).Sequence Quality. Evaluation of sequence quality and coverage from the candidate loci demonstrated that high quality bidirectional sequences were routinely obtained from rbcL, rpoC1, and rpoB (Fig. 1B, x axis). The remaining 4 loci required more manual editing and produced f...
The nucleotide sequence of Korean ginseng (Panax schinseng Nees) chloroplast genome has been completed (AY582139). The circular double-stranded DNA, which consists of 156,318 bp, contains a pair of inverted repeat regions (IRa and IRb) with 26,071 bp each, which are separated by small and large single copy regions of 86,106 bp and 18,070 bp, respectively. The inverted repeat region is further extended into a large single copy region which includes the 5' parts of the rpsl9 gene. Four short inversions associated with short palindromic sequences that form stem-loop structures were also observed in the chloroplast genome of P. schinseng compared to that of Nicotiana tabacum. The genome content and the relative positions of 114 genes (75 peptide-encoding genes, 30 tRNA genes, 4 rRNA genes, and 5 conserved open reading frames [ycfs]), however, are identical with the chloroplast DNA of N. tabacum. Sixteen genes contain one intron while two genes have two introns. Of these introns, only one (trnL-UAA) belongs to the self-splicing group I; all remaining introns have the characteristics of six domains belonging to group II. Eighteen simple sequence repeats have been identified from the chloroplast genome of Korean ginseng. Several of these SSR loci show infra-specific variations. A detailed comparison of 17 known completed chloroplast genomes from the vascular plants allowed the identification of evolutionary modes of coding segments and intron sequences, as well as the evaluation of the phylogenetic utilities of chloroplast genes. Furthermore, through the detailed comparisons of several chloroplast genomes, evolutionary hotspots predominated by the inversion end points, indel mutation events, and high frequencies of base substitutions were identified. Large-sized indels were often associated with direct repeats at the end of the sequences facilitating intra-molecular recombination.
An extensive sequence comparison of the chloroplast ndhF gene from all major clades of the largest flowering plant family (Asteraceae) shows that this gene provides approximately 3 times more phylogenetic information than rbcL. This is because it is substantially longer and evolves twice as fast. The 5' region (1380 bp) of ndhF is very different from the 3' region (855 bp) and is similar to rbcL in both the rate and the pattern of sequence change. The 3' region is more A+T-rich, has higher levels of nonsynonymous base substitution, and shows greater transversion bias at all codon positions. These differences probably reflect different functional constraints on the 5' and 3' regions of ndhF. The two patterns of base substitutions of ndhF are particularly advantageous for phylogenetic reconstruction because the conserved and variable segments can be used for older and recent groups, respectively. Phylogenetic analyses of 94 ndhF sequences provided much better resolution of relationships than previous molecular and morphological phylogenies of the Asteraceae. The ndhF tree identified five major clades: (i) the Calyceraceae is the sister family of Asteraceae; (ii) the Barnadesioideae is monophyletic and is the sister group to the rest of the family; (iii) the Cichorioideae and its two basal tribes Mutisieae and Cardueae are paraphyletic; (iv) four tribes of Cichorioideae (Lactuceae, Arctoteae, Liabeae, and Vernonieae) form a monophyletic group, and these are the sister clade of the Asteroideae; and (v) the Asteroideae is monophyletic and includes three major clades.
The chloroplast (cp) DNA sequence of Jasminum nudiflorum (Oleaceae-Jasmineae) is completed and compared with the large single-copy region sequences from 6 related species. The cp genomes of the tribe Jasmineae (Jasminum and Menodora) show several distinctive rearrangements, including inversions, gene duplications, insertions, inverted repeat expansions, and gene and intron losses. The ycf4-psaI region in Jasminum section Primulina was relocated as a result of 2 overlapping inversions of 21,169 and 18,414 bp. The 1st, larger inversion is shared by all members of the Jasmineae indicating that it occurred in the common ancestor of the tribe. Similar rearrangements were also identified in the cp genome of Menodora. In this case, 2 fragments including ycf4 and rps4-trnS-ycf3 genes were moved by 2 additional inversions of 14 and 59 kb that are unique to Menodora. Other rearrangements in the Oleaceae are confined to certain regions of the Jasminum and Menodora cp genomes, including the presence of highly repeated sequences and duplications of coding and noncoding sequences that are inserted into clpP and between rbcL and psaI. These insertions are correlated with the loss of 2 introns in clpP and a serial loss of segments of accD. The loss of the accD gene and clpP introns in both the monocot family Poaceae and the eudicot family Oleaceae are clearly independent evolutionary events. However, their genome organization is surprisingly similar despite the distant relationship of these 2 angiosperm families.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.