A fungal mock community control for amplicon sequencing experiments

Bakker, Matthew G.

doi:10.1111/1755-0998.12760

Cited by 67 publications

(89 citation statements)

References 54 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Besides Archaeorhizomycetes, the ITS9munngs + TW13 primer pair tended to overyield Rhizaria and Sordariomycetes and other groups with short sequences but discriminated against Metazoa, Chytridiomycota and Agaricomycetes with relatively long ITS sequences. Thus, our results confirm identification biases in Illumina MiSeq and PacBio RSII platforms against taxa with longer ITS sequences (Bakker ; Tedersoo et al . , ).…”

Section: Resultssupporting

confidence: 86%

Towards PacBio‐based pan‐eukaryote metabarcoding using full‐length ITS sequences

Tedersoo

Anslan

2019

Environ Microbiol Rep

View full text Add to dashboard Cite

Summary Development of high‐throughput sequencing techniques has greatly benefited our understanding about microbial ecology, yet the methods producing short reads suffer from species‐level resolution and uncertainty of identification. Here, we optimize Pacific Biosciences‐based metabarcoding protocols covering the internal transcribed spacer (ITS region) and partial small subunit of the rRNA gene for species‐level identification of all eukaryotes, with a specific focus on Fungi (including Glomeromycota) and Stramenopila (particularly Oomycota). Based on tests on composite soil samples and mock communities, we propose best suitable degenerate primers, ITS9munngs + ITS4ngsUni for eukaryotes and selected groups therein and discuss the pros and cons of long read‐based identification of eukaryotes.

show abstract

Section: Resultssupporting

confidence: 86%

Towards PacBio‐based pan‐eukaryote metabarcoding using full‐length ITS sequences

Tedersoo

Anslan

2019

Environ Microbiol Rep

View full text Add to dashboard Cite

show abstract

“…Thus, the interpretation of internal transcribed spacer (ITS) sequencing data as semi‐quantitative (i.e. correctly representing relative abundances of members) in fungal community studies has been questioned based on observations of biases and errors when sequencing artificially assembled (‘mock’) communities (Amend et al ., 2010; Bakker, 2018; Palmer et al ., 2018). Such biases and errors affect perceived differences in composition and diversity among communities, and should be minimized (Frøslev et al ., 2017; Nilsson et al ., 2019).…”

Section: Introductionmentioning

confidence: 99%

Optimized metabarcoding with Pacific biosciences enables semi‐quantitative analysis of fungal communities

et al. 2020

View full text Add to dashboard Cite

Recent studies have questioned the use of high-throughput sequencing of the nuclear ribosomal internal transcribed spacer (ITS) region to derive a semi-quantitative representation of fungal community composition. However, comprehensive studies that quantify biases occurring during PCR and sequencing of ITS amplicons are still lacking. We used artificially assembled communities consisting of 10 ITS-like fragments of varying lengths and guanine-cytosine (GC) contents to evaluate and quantify biases during PCR and sequencing with Illumina MiSeq, PacBio RS II and PacBio Sequel I technologies. Fragment length variation was the main source of bias in observed community composition relative to the template, with longer fragments generally being under-represented for all sequencing platforms. This bias was three times higher for Illumina MiSeq than for PacBio RS II and Sequel I. All 10 fragments in the artificial community were recovered when sequenced with PacBio technologies, whereas the three longest fragments (> 447 bases) were lost when sequenced with Illumina MiSeq. Fragment length bias also increased linearly with increasing number of PCR cycles but could be mitigated by optimization of the PCR setup. No significant biases related to GC content were observed. Despite lower sequencing output, PacBio sequencing was better able to reflect the community composition of the template than Illumina MiSeq sequencing.

show abstract

“…To demonstrate dadasnake's potential to accurately determine community composition and richness, two mock community datasets from Illumina sequencing of bacterial [36] and fungal [37] DNA were analysed. In both cases, the genus-level composition was determined mostly correctly ( Figure 2 a&b; supplementary table 2).…”

Section: Use Casementioning

confidence: 99%

“…The ITS2 region of a fungal mock community [37] was amplified using the primers F-ITS4 5-TCCTCCGCTTATTGATATGC [45] and R-fITS7 5-GTGARTCATCGAATCTTTG [46] modified with heterogeneity spacers according to [47]. Amplicon libraries were prepared using the Nextera XT kit (Illumina) and sequenced on an Illumina MiSeq with v.3 chemistry at 2 x 300 bp.…”

Section: Fungal Mock Community Sequencingmentioning

confidence: 99%

dadasnake, a Snakemake implementation of DADA2 to process amplicon sequencing data for microbial ecology

Weißbecker

Schnabel

Heintz‐Buschart

2020

Preprint

View full text Add to dashboard Cite

Background: Amplicon sequencing of phylogenetic marker genes, e.g. 16S, 18S or ITS rRNA sequences, is still the most commonly used method to estimate the structure of microbial communities. Microbial ecologists often have expert knowledge on their biological question and data analysis in general, and most research institutes have computational infrastructures to employ the bioinformatics command line tools and workflows for amplicon sequencing analysis, but requirements of bioinformatics skills often limit the efficient and up-to-date use of computational resources. Results: dadasnake wraps pre-processing of sequencing reads, delineation of exact sequencing variants using the favorably benchmarked, widely-used the DADA2 algorithm, taxonomic classification and post-processing of the resultant tables, and hand-off in standard formats, into a userfriendly, one-command Snakemake pipeline. The suitability of the provided default configurations is demonstrated using mock-community data from bacteria and archaea, as well as fungi. Conclusions: By use of Snakemake, dadasnake makes efficient use of high-performance computing infrastructures. Easy user configuration guarantees flexibility of all steps, including the processing of data from multiple sequencing platforms. dadasnake facilitates easy installation via conda environments. dadasnake is available at https://github.com/a-h-b/dadasnake . Findings BackgroundSince the first reports 15 years ago [1], high-throughput amplicon sequencing has become the most common approach to monitor microbial diversity in environmental samples. Sequencing preparation, throughput and precision have been consistently improved, while costs have decreased. Computational methods have been refined in the recent years, especially with the shift to exact sequencing variants and better use of sequence quality data [2,3]. While amplicon sequencing can have severe limitations, such as limited and uneven taxonomic resolution [4,5], over-and underestimation of diversity [6,7], lack of quantitative value [8,9] and missing functional information, amplicon sequencing is still considered the method of choice to gain an overview of microbial diversity in a large number of samples [10,11]. Consequently, the sizes of typical amplicon sequencing datasets have grown. In addition, synthesis efforts are undertaken, requiring efficient processing pipelines for amplicon sequencing data [12]. Due to the unique, microbiome-specific characteristics of each dataset and the need to integrate the community structure data with other data types, such as abiotic or biotic parameters, users of data processing tools need to have expert knowledge on their biological question and statistics. It is therefore desirable that workflows should be as user-friendly as possible. Several widely used workflows exist e.g. qiime2 [13], mothur [14], usearch [15], lOTUs [16], with new approaches continually being developed, e.g. OCToPUS [17], PEMA [18], typically balancing learning curves, configurability and efficiency.Purpose of dadasna...

show abstract

A fungal mock community control for amplicon sequencing experiments

Cited by 67 publications

References 54 publications

Towards PacBio‐based pan‐eukaryote metabarcoding using full‐length ITS sequences

Towards PacBio‐based pan‐eukaryote metabarcoding using full‐length ITS sequences

Optimized metabarcoding with Pacific biosciences enables semi‐quantitative analysis of fungal communities

dadasnake, a Snakemake implementation of DADA2 to process amplicon sequencing data for microbial ecology

Contact Info

Product

Resources

About