2018
DOI: 10.1186/s40168-018-0470-z
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin

Abstract: BackgroundTaxonomic classification of marker-gene sequences is an important step in microbiome analysis.ResultsWe present q2-feature-classifier (https://github.com/qiime2/q2-feature-classifier), a QIIME 2 plugin containing several novel machine-learning and alignment-based methods for taxonomy classification. We evaluated and optimized several commonly used classification methods implemented in QIIME 1 (RDP, BLAST, UCLUST, and SortMeRNA) and several new methods implemented in QIIME 2 (a scikit-learn naive Baye… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

5
2,576
0
15

Year Published

2019
2019
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 3,968 publications
(2,596 citation statements)
references
References 38 publications
5
2,576
0
15
Order By: Relevance
“…However, the lack of accurate and easily customizable bioinformatic pipelines is limiting the broader application of eDNA approaches. The Anacapa Toolkit provides enhanced functionality for eDNA projects and can be used for other common applications such as gut content analysis (Leray et al, ), autonomous reef monitoring structures (Ransome et al, ), and microbiomes (Bokulich et al, ). Importantly, the Anacapa Toolkit is modular and its parameters are easily modifiable, making it easily adapted to user specific needs in several important ways.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, the lack of accurate and easily customizable bioinformatic pipelines is limiting the broader application of eDNA approaches. The Anacapa Toolkit provides enhanced functionality for eDNA projects and can be used for other common applications such as gut content analysis (Leray et al, ), autonomous reef monitoring structures (Ransome et al, ), and microbiomes (Bokulich et al, ). Importantly, the Anacapa Toolkit is modular and its parameters are easily modifiable, making it easily adapted to user specific needs in several important ways.…”
Section: Resultsmentioning
confidence: 99%
“…Importantly, the Anacapa Toolkit is modular and its parameters are easily modifiable, making it easily adapted to user specific needs in several important ways. First, CRUX reference databases are compatible with alternative classifiers (Bokulich et al, ), and users can append their own reference sequences to CRUX databases as needed. Second, the Q uality Control and ASV Parsing module is designed to process pooled metabarcoding libraries and automatically sort them by barcode and sample.…”
Section: Resultsmentioning
confidence: 99%
“…The resulting sequences were stored in a fastq file generated for each sample and processed using QIIME2 v2018·11 software, importing data through a manifest file. Then, single‐end raw reads were filtered, quality retained for quality score > 20 and chimeras were removed using DADA2 v2018·4 .…”
Section: Methodsmentioning
confidence: 99%
“…The α‐ and β‐diversity metrics were estimated from a subsampled ASV dataset, with 45,000 sequences per sample (Table S2). Each ASV was identified using a Naïve Bayes classifier trained on 16S rRNA gene sequences from the Greengenes release 13_8 dataset (Bokulich et al, ; Table S3). Principal coordinate analysis and other statistical analyses were performed using custom R and Python scripts.…”
Section: Methodsmentioning
confidence: 99%