2018
DOI: 10.1186/s13059-018-1568-0
|View full text |Cite
|
Sign up to set email alerts
|

KrakenUniq: confident and fast metagenomics classification using unique k-mer counts

Abstract: False-positive identifications are a significant problem in metagenomics classification. We present KrakenUniq, a novel metagenomics classifier that combines the fast k-mer-based classification of Kraken with an efficient algorithm for assessing the coverage of unique k-mers found in each species in a dataset. On various test datasets, KrakenUniq gives better recall and precision than other methods and effectively classifies and distinguishes pathogens with low abundance from false positives in infectious dise… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
351
1
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 348 publications
(355 citation statements)
references
References 30 publications
2
351
1
1
Order By: Relevance
“…Ten samples from each of MSBB_WES (syn7541077), MSBB_RNA (syn8612191) and MAYO_TCX (syn8612203) datasets with the greatest number of reported HHV6A (n=5/dataset) and HHV7 reads (n=5/dataset) were selected for further analysis (n=30 total). Raw reads were preprocessed with fastp, then taxonomically categorized using KrakenUniq, a fast yet highly sensitive method based on k-mers (Breitwieser et al, 2018;Chen et al, 2018). KrakenUniq identified a total of 13 HHV6A reads in 2/15 top HHV6A samples (Readhead total: 75 reads), and failed to identify any HH7 reads in the top HHV7 subset (Readhead total: 93 reads in 15 samples).…”
Section: Methods and Resultsmentioning
confidence: 99%
“…Ten samples from each of MSBB_WES (syn7541077), MSBB_RNA (syn8612191) and MAYO_TCX (syn8612203) datasets with the greatest number of reported HHV6A (n=5/dataset) and HHV7 reads (n=5/dataset) were selected for further analysis (n=30 total). Raw reads were preprocessed with fastp, then taxonomically categorized using KrakenUniq, a fast yet highly sensitive method based on k-mers (Breitwieser et al, 2018;Chen et al, 2018). KrakenUniq identified a total of 13 HHV6A reads in 2/15 top HHV6A samples (Readhead total: 75 reads), and failed to identify any HH7 reads in the top HHV7 subset (Readhead total: 93 reads in 15 samples).…”
Section: Methods and Resultsmentioning
confidence: 99%
“…We compared ganon against kraken (Wood and Salzberg, 2014), one of the most used k-mer based methods for metagenomics short read classification and its newer version, kraken2 (Wood et al, 2019). We also included krakenuniq (Breitwieser et al, 2018), which uses the basic kraken algorithm and also allows classification on more specific levels after taxonomic assignments (e.g. up to assembly or sequence level).…”
Section: Resultsmentioning
confidence: 99%
“…Data visualization. We created the Sankey plots using the krakenuniq-report tool from KrakenUniq [30] to create a Kraken-style report from our predicted contaminations. The visualization was done using Pavian [31] extracted as SVG and colored by Inkscape.…”
Section: Methodsmentioning
confidence: 99%