DNA barcoding involves sequencing a standard region of DNA as a tool for species identification. However, there has been no agreement on which region(s) should be used for barcoding land plants. To provide a community recommendation on a standard plant barcode, we have compared the performance of 7 leading candidate plastid DNA regions (atpF-atpH spacer, matK gene, rbcL gene, rpoB gene, rpoC1 gene, psbK-psbI spacer, and trnH-psbA spacer). Based on assessments of recoverability, sequence quality, and levels of species discrimination, we recommend the 2-locus combination of rbcL؉matK as the plant barcode. This core 2-locus barcode will provide a universal framework for the routine use of DNA sequence data to identify specimens and contribute toward the discovery of overlooked species of land plants.matK ͉ rbcL ͉ species identification L arge-scale standardized sequencing of the mitochondrial gene CO1 has made DNA barcoding an efficient species identification tool in many animal groups (1). In plants, however, low substitution rates of mitochondrial DNA have led to the search for alternative barcoding regions. From initial investigations of plastid regions (2-4), 7 leading candidates have emerged (5, 6). Four are portions of coding genes (matK, rbcL, rpoB, and rpoC1), and 3 are noncoding spacers (atpF-atpH, trnH-psbA, and psbK-psbI). Different research groups have proposed various combinations of these loci as their preferred plant barcodes, but no consensus has emerged (5-12). This lack of an agreed standard has impeded progress in plant barcoding.Our aim here is to identify a standard DNA barcode for land plants. To achieve this goal, we have pooled data across laboratories including sequence data from 907 samples, representing 445 angiosperm, 38 gymnosperm, and 67 cryptogam species. Using various subsets of these data, we evaluated the 7 candidate loci using criteria in the Consortium for the Barcode of Life's (CBOL) data standards and guidelines for locus selection (http:// www.barcoding.si.edu/protocols.html). Universality: Which loci can be routinely sequenced across the land plants? Sequence quality and coverage: Which loci are most amenable to the production of bidirectional sequences with few or no ambiguous base calls? Discrimination: Which loci enable most species to be distinguished? ResultsUniversality. Direct universality assessments using a single primer pair for each locus in angiosperms resulted in 90%-98% PCR and sequencing success for 6/7 regions. Success for the seventh region, psbK-psbI, was 77% (Fig. 1A). Greater problems were encountered in other land plant groups, with rpoB, matK, atpF-atpH, and psbK-psbI all showing Ͻ50% success in gymnosperms and/or cryptogams based on data compiled from several laboratories (Fig. 1 A).Sequence Quality. Evaluation of sequence quality and coverage from the candidate loci demonstrated that high quality bidirectional sequences were routinely obtained from rbcL, rpoC1, and rpoB (Fig. 1B, x axis). The remaining 4 loci required more manual editing and produced f...
A universal barcode system for land plants would be a valuable resource, with potential utility in fields as diverse as ecology, floristics, law enforcement and industry. However, the application of plant barcoding has been constrained by a lack of consensus regarding the most variable and technically practical DNA region(s). We compared eight candidate plant barcoding regions from the plastome and one from the mitochondrial genome for how well they discriminated the monophyly of 92 species in 32 diverse genera of land plants (N = 251 samples). The plastid markers comprise portions of five coding (rpoB, rpoC1, rbcL, matK and 23S rDNA) and three non-coding (trnH-psbA, atpF–atpH, and psbK–psbI) loci. Our survey included several taxonomically complex groups, and in all cases we examined multiple populations and species. The regions differed in their ability to discriminate species, and in ease of retrieval, in terms of amplification and sequencing success. Single locus resolution ranged from 7% (23S rDNA) to 59% (trnH-psbA) of species with well-supported monophyly. Sequence recovery rates were related primarily to amplification success (85–100% for plastid loci), with matK requiring the greatest effort to achieve reasonable recovery (88% using 10 primer pairs). Several loci (matK, psbK–psbI, trnH-psbA) were problematic for generating fully bidirectional sequences. Setting aside technical issues related to amplification and sequencing, combining the more variable plastid markers provided clear benefits for resolving species, although with diminishing returns, as all combinations assessed using four to seven regions had only marginally different success rates (69–71%; values that were approached by several two- and three-region combinations). This performance plateau may indicate fundamental upper limits on the precision of species discrimination that is possible with DNA barcoding systems that include moderate numbers of plastid markers. Resolution to the contentious debate on plant barcoding should therefore involve increased attention to practical issues related to the ease of sequence recovery, global alignability, and marker redundancy in multilocus plant DNA barcoding systems.
The ability to discriminate between species using barcoding loci has proved more difficult in plants than animals, raising the possibility that plant species boundaries are less well defined. Here, we review a selection of published barcoding data sets to compare species discrimination in plants vs. animals. Although the use of different genetic markers, analytical methods and depths of taxon sampling may complicate comparisons, our results using common metrics demonstrate that the number of species supported as monophyletic using barcoding markers is higher in animals (> 90%) than plants (~70%), even after controlling for the amount of parsimony-informative information per species. This suggests that more than a simple lack of variability limits species discrimination in plants. Both animal and plant species pairs have variable size gaps between intra- and interspecific genetic distances, but animal species tend to have larger gaps than plants, even in relatively densely sampled genera. An analysis of 12 plant genera suggests that hybridization contributes significantly to variation in genetic discontinuity in plants. Barcoding success may be improved in some plant groups by careful choice of markers and appropriate sampling; however, overall fine-scale species discrimination in plants relative to animals may be inherently more difficult because of greater levels of gene-tree paraphyly.
Pollen DNA metabarcoding-marker-based genetic identification of potentially mixed-species pollen samples-has applications across a variety of fields. While basic species-level pollen identification using standard DNA barcode markers is established, the extent to which metabarcoding (a) correctly assigns species identities to mixes (qualitative matching) and (b) generates sequence reads proportionally to their relative abundance in a sample (quantitative matching) is unclear, as these have not been assessed relative to known standards. We tested the quantitative and qualitative robustness of metabarcoding in constructed pollen mixtures varying in species richness (1-9 species), taxonomic relatedness (within genera to across class) and rarity (5%-100% of grains), using Illumina MiSeq with the markers rbcL and ITS2. Qualitatively, species composition determinations were largely correct, but false positives and negatives occurred. False negatives were typically driven by lack of a barcode gap or rarity in a sample. Species richness and taxonomic relatedness, however, did not strongly impact correct determinations. False positives were likely driven by contamination, chimeric sequences and/or misidentification by the bioinformatics pipeline. Quantitatively, the proportion of reads for each species was only weakly correlated with its relative abundance, in contrast to suggestions from some other studies. Quantitative mismatches are not correctable by consistent scaling factors, but instead are context-dependent on the other species present in a sample. Together, our results show that metabarcoding is largely robust for determining pollen presence/absence but that sequence reads should not be used to infer relative abundance of pollen grains.
Identification of the species origin of pollen has many applications, including assessment of plant-pollinator networks, reconstruction of ancient plant communities, product authentication, allergen monitoring, and forensics. Such applications, however, have previously been limited by microscopy-based identification of pollen, which is slow, has low taxonomic resolution, and has few expert practitioners. One alternative is pollen DNA barcoding, which could overcome these issues. Recent studies demonstrate that both chloroplast and nuclear barcoding markers can be amplified from pollen. These recent validations of pollen metabarcoding indicate that now is the time for researchers in various fields to consider applying these methods to their research programs. In this paper, we review the nascent field of pollen DNA barcoding and discuss potential new applications of this technology, highlighting existing limitations and future research developments that will improve its utility in a wide range of applications.Key words: DNA metabarcoding, metagenomics, pollen, palynology, high-throughput sequencing, next-generation sequencing.Résumé : L'identification de l'espèce à l'origine d'un pollen se prête à de nombreuses applications dont la description des réseaux plante-pollinisateur, la reconstruction de communautés de plantes anciennes, l'authentification de produits, la surveillance des allergènes et les enquêtes médicolégales. Cependant, ces applications ont précédemment été limitées à l'identification du pollen par examen microscopique, un processus lent, à faible résolution taxonomique et qui compte peu de praticiens experts. Une alternative est l'identification du pollen par le recours aux codes à barres de l'ADN, une avenue qui permettrait de surmonter plusieurs de ces limitations. De récentes études ont montré qu'il était possible d'amplifier les marqueurs de codage tant chloroplastiques que nucléaires à partir du pollen. Ces récentes validations du métacodage à barres chez le pollen indiquent qu'il est maintenant opportun pour les chercheurs dans divers domaines de considérer l'emploi de ces méthodes dans leurs programmes de recherche. Dans cet article, les auteurs passent en revue le domaine naissant du codage à barres du pollen et discutent des nouvelles applications potentielles de cette technologie en mettant en lumière les limitations existantes ainsi que de futurs développements qui pourraient accroître son utilité dans un grand nombre d'applications. [Traduit par la Rédaction] Mots-clés : métacodage à barres, métagénomique, pollen, palynologie, séquençage à haut débit, séquençage de nouvelle génération.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.