2018
DOI: 10.7717/peerj.4892
|View full text |Cite
|
Sign up to set email alerts
|

BLAST-based validation of metagenomic sequence assignments

Abstract: When performing bioforensic casework, it is important to be able to reliably detect the presence of a particular organism in a metagenomic sample, even if the organism is only present in a trace amount. For this task, it is common to use a sequence classification program that determines the taxonomic affiliation of individual sequence reads by comparing them to reference database sequences. As metagenomic data sets often consist of millions or billions of reads that need to be compared to reference databases c… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(14 citation statements)
references
References 38 publications
0
14
0
Order By: Relevance
“…Therefore, there is a trade-off between sensitivity, accuracy, performance, speed, and computational requirements. Although these types of programs are useful in performing “first-pass” taxonomic analyses it is crucial that their assignments are validated using BLAST-based procedures [ 83 ]. The absolute majority of the transcripts were of plant origin, most hitting legume sequences as identified by DIAMOND-NCBI(NR) analysis.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, there is a trade-off between sensitivity, accuracy, performance, speed, and computational requirements. Although these types of programs are useful in performing “first-pass” taxonomic analyses it is crucial that their assignments are validated using BLAST-based procedures [ 83 ]. The absolute majority of the transcripts were of plant origin, most hitting legume sequences as identified by DIAMOND-NCBI(NR) analysis.…”
Section: Discussionmentioning
confidence: 99%
“…Remaining reads were mapped on viral RefSeq database (downloaded in March 2015) to determine the relative abundance of anelloviruses within the plasma virome. As recommended, a BLAST-based validation of metagenomic sequence assignments was performed for anelloviruses [ 11 ]. Thus, nonhuman NGS reads were aligned using BLAST on a manually curated database composed of the 56 reference sequences of human anelloviruse species as established by the International Committee on Taxonomy of Viruses (ICTV, TTV-1 to TTV-29, TTMV-1 to TTMV-12, TTMDV-1 and TTMDV-15).…”
Section: Methodsmentioning
confidence: 99%
“…Seventy-five percent of ASV yielded firstpass satisfactory taxonomic assignment with > 95% homology with their first-hit assigned sequence. The remaining 25% of ASV were blasted again against the next hit proposed by Kraken2 until a higher homology score was reached in a method inspired by [94]. After these iterative steps, we assigned up to 85% of all ASV with a homology score above 95%.…”
Section: Richness and Diversitymentioning
confidence: 99%