Uncovering Effects from the Structure of Metabarcode Sequences for Metagenetic and Microbiome Analysis

Molik, David; Pfrender, Michael E.; Emrich, Scott J.

doi:10.3390/mps3010022

Cited by 4 publications

(4 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These biological entities next can be compared with OTUs or ASVs in different studies, such as the BIN framework introduced by BOLD, to estimate the biodiversity of target samples. Yet biological interpretation of metabarcoding data can be seriously affected by the differences between the two methods: OTUs minimize the effects of slight variations in sequences that may or may not be of interest, but a small change, as in the case of parasitoid wasps, could be capturing actual differences between species; on the contrary, ASVs are defined as all “unique reads” within a metabarcoded dataset, often leading to a wrong differentiation between the SNPs of the same species, and in the same way making sequencing or PCR errors more prominent when compared to OTUs (Molik et al 2020 ). By using simulations, it has been advised that approaches utilizing ASVs outperform OTUs only when the sequencing depth is sufficient to cover a biological complexity with low polymorphisms.…”

Section: Development Of Dna Barcodingmentioning

confidence: 99%

Life barcoded by DNA barcodes

Guo

Yuan

Tao

et al. 2022

Conservation Genet Resour

View full text Add to dashboard Cite

The modern concept of DNA-based barcoding for cataloguing biodiversity was proposed in 2003 by first adopting an approximately 600 bp fragment of the mitochondrial COI gene to compare via nucleotide alignments with known sequences from specimens previously identified by taxonomists. Other standardized regions meeting barcoding criteria then are also evolving as DNA barcodes for fast, reliable and inexpensive assessment of species composition across all forms of life, including animals, plants, fungi, bacteria and other microorganisms. Consequently, global DNA barcoding campaigns have resulted in the formation of many online workbenches and databases, such as BOLD system, as barcode references, and facilitated the development of mini-barcodes and metabarcoding strategies as important extensions of barcode techniques. Here we intend to give an overview of the characteristics and features of these barcode markers and major reference libraries existing for barcoding the planet’s life, as well as to address the limitations and opportunities of DNA barcodes to an increasingly broader community of science and society.

show abstract

Section: Development Of Dna Barcodingmentioning

confidence: 99%

Life barcoded by DNA barcodes

Guo

Yuan

Tao

et al. 2022

Conservation Genet Resour

View full text Add to dashboard Cite

show abstract

“…ASVs are commonly generated using the Divisive Amplicon Denoising Algorithm 2 (DADA2), and the resultant ASVs represent true biological sequences obtained from reads (Callahan et al, 2016). In addition, there have been recent efforts to use the occurrence of short-chain k-mer (15-30mer) (Molik et al, 2020), and very short-chain k-mers (<10mer) (Asgari et al, 2018(Asgari et al, , 2019, within reads that offer a unique reference-free and alignment-free approach to provide a data representation upon which a phenotype prediction model is built. We have included both of these k-mer approaches in our review to compare them directly with the OTU/ASV assignment methods.…”

Section: Introductionmentioning

confidence: 99%

Systematic Comparisons for Composition Profiles, Taxonomic Levels, and Machine Learning Methods for Microbiome-Based Disease Prediction

Song

Wright

Zhou

2020

Front. Mol. Biosci.

View full text Add to dashboard Cite

Microbiome composition profiles generated from 16S rRNA sequencing have been extensively studied for their usefulness in phenotype trait prediction, including for complex diseases such as diabetes and obesity. These microbiome compositions have typically been quantified in the form of Operational Taxonomic Unit (OTU) count matrices. However, alternate approaches such as Amplicon Sequence Variants (ASV) have been used, as well as the direct use of k-mer sequence counts. The overall effect of these different types of predictors when used in concert with various machine learning methods has been difficult to assess, due to varied combinations described in the literature. Here we provide an in-depth investigation of more than 1,000 combinations of these three clustering/counting methods, in combination with varied choices for normalization and filtering, grouping at various taxonomic levels, and the use of more than ten commonly used machine learning methods for phenotype prediction. The use of short k-mers, which have computational advantages and conceptual simplicity, is shown to be effective as a source for microbiome-based prediction. Among machine-learning approaches, tree-based methods show consistent, though modest, advantages in prediction accuracy. We describe the various advantages and disadvantages of combinations in analysis approaches, and provide general observations to serve as a useful guide for future trait-prediction explorations using microbiome data.

show abstract

“…For mock community analysis, metatranscriptomics provided the most reliable species diversity and community composition estimates, which closely resembled those derived from morphological data. The use of metatranscriptomics avoided the co‐detection of extra‐organismal eDNA and minimized background noise encountered during PCR‐based methods, which may cause inflated estimates of species richness and complicated taxonomic assignment of sequences, especially with the absence of good quality reference databases (Molik et al, 2020 ). Another advantage of using RNA for monitoring zooplankton is that it avoids the bias related to NUMT pseudogene contamination (Collura et al, 1996 ).…”

Section: Discussionmentioning

confidence: 99%

Using metatranscriptomics to estimate the diversity and composition of zooplankton communities

Lopez

Lin

Sato

et al. 2021

Molecular Ecology Resources

View full text Add to dashboard Cite

DNA metabarcoding is a rapid, high-resolution tool used for biomonitoring complex zooplankton communities. However, diversity estimates derived with this approach can be biased by the co-detection of sequences from environmental DNA (eDNA), nuclear-encoded mitochondrial (NUMT) pseudogene contamination, and taxonspecific PCR primer affinity differences. To avoid these methodological uncertainties, we tested the use of metatranscriptomics as an alternative approach for characterizing zooplankton communities. Specifically, we compared metatranscriptomics with PCR-based methods using genomic (gDNA) and complementary DNA (cDNA) amplicons, and morphology-based data for estimating species diversity and composition for both mock communities and field-collected samples. Mock community analyses showed that the use of gDNA mitochondrial cytochrome c oxidase I (mtCO1) amplicons inflates species richness due to the co-detection of extra-organismal eDNA.Significantly more amplicon sequence variants, nucleotide diversity, and indels were observed with gDNA amplicons than with cDNA, indicating the presence of putative NUMT pseudogenes. Moreover, PCR-based methods failed to detect the most abundant species in mock communities due to priming site mismatch. Overall, metatranscriptomics provided estimates of species richness and composition that closely resembled those derived from morphological data. The use of metatranscriptomics was further tested using field-collected samples, with the results showing consistent species diversity estimates among biological and technical replicates. Additionally, temporal zooplankton species composition changes could be monitored using different mitochondrial markers. These findings demonstrate the advantages of metatranscriptomics as an effective tool for monitoring diversity in zooplankton research.

show abstract

Uncovering Effects from the Structure of Metabarcode Sequences for Metagenetic and Microbiome Analysis

Cited by 4 publications

References 34 publications

Life barcoded by DNA barcodes

Life barcoded by DNA barcodes

Systematic Comparisons for Composition Profiles, Taxonomic Levels, and Machine Learning Methods for Microbiome-Based Disease Prediction

Using metatranscriptomics to estimate the diversity and composition of zooplankton communities

Contact Info

Product

Resources

About