2016
DOI: 10.12688/f1000research.8986.2
|View full text |Cite
|
Sign up to set email alerts
|

Bioconductor Workflow for Microbiome Data Analysis: from raw reads to community analyses

Abstract: High-throughput sequencing of PCR-amplified taxonomic markers (like the 16S rRNA gene) has enabled a new level of analysis of complex bacterial communities known as microbiomes. Many tools exist to quantify and compare abundance levels or OTU composition of communities in different conditions. The sequencing reads have to be denoised and assigned to the closest taxa from a reference database. Common approaches use a notion of 97% similarity and normalize the data by subsampling to equalize library sizes. In th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
512
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 706 publications
(515 citation statements)
references
References 31 publications
3
512
0
Order By: Relevance
“…Recently, nonclustering strategies (e.g., DADA2 and UNOISE3) were held in esteem in numerous studies to generate ZOTUs (Hugerth & Andersson, ), as these methods improved resolution on biodiversity assessment, which was a cornerstone for ecological studies (Callahan et al., ; Edgar, ). Interestingly, we recapitulated the results of nonclustering methods when we explored community–environment relationship by the use of clustering thresholds of 99%–97% in the present study.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…Recently, nonclustering strategies (e.g., DADA2 and UNOISE3) were held in esteem in numerous studies to generate ZOTUs (Hugerth & Andersson, ), as these methods improved resolution on biodiversity assessment, which was a cornerstone for ecological studies (Callahan et al., ; Edgar, ). Interestingly, we recapitulated the results of nonclustering methods when we explored community–environment relationship by the use of clustering thresholds of 99%–97% in the present study.…”
Section: Discussionmentioning
confidence: 99%
“…In the DADA2 workflow, sequence reads were filtered using the standard filtering parameters (i.e., maxN = 0, truncQ = 2, rm.phix = TRUE and maxEE = 2) in the online pipeline suggested by Callahan et al. () and trimmed to 225 bp as suggested by our previous study (Xiong et al., ). Then error model parameter learning was carried out on the sequences passing the filtering step.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Trimmed (raw) reads were then processed using the R language environment v.3.5.0 and RStudio v.1.1.447, following the DADA2 v.1.8.0 workflow described by Callahan et al . Applied pipeline settings were explained in detail, before .…”
Section: Methodsmentioning
confidence: 99%