2022
DOI: 10.1101/2022.04.27.489679
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Selection of software and database for metagenomics sequence analysis impacts the outcome of microbial profiling and pathogen detection

Abstract: AimShotgun metagenomic sequencing analysis is widely used for microbial profiling of biological specimens and pathogen detection. However, very little is known about the technical biases caused by the choice of analysis software and databases. In this study, we evaluated shotgun metagenomics taxonomical profiling software to characterize the microbial compositions of biological samples collected from wild rodents.Method and ResultsUsing nine of the most widely used metagenomics software and four different data… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 65 publications
0
4
0
Order By: Relevance
“…Although still widely used, MetaPhlAn3 [53][54][55] (along with its previous versions [35-37, 39-42, 50-52], see Supplementary Table S2), did not withstand the competition with more recent tools. Its primary handicap stemmed from its highly constricted native feature space, resulting in a substantial number of false negatives, thereby impairing all performance metrics.…”
Section: Discussionmentioning
confidence: 98%
“…Although still widely used, MetaPhlAn3 [53][54][55] (along with its previous versions [35-37, 39-42, 50-52], see Supplementary Table S2), did not withstand the competition with more recent tools. Its primary handicap stemmed from its highly constricted native feature space, resulting in a substantial number of false negatives, thereby impairing all performance metrics.…”
Section: Discussionmentioning
confidence: 98%
“…Although still widely used, MetaPhlAn3 [51][52][53] (along with its previous versions [35-37, 39-42, 48-50], see Supplementary Table S2), did not withstand the competition with more recent tools. Its primary handicap stemmed from its highly constricted native feature space, resulting in a substantial number of false negatives, thereby impairing all performance metrics.…”
Section: Discussionmentioning
confidence: 98%
“…Accordingly, the National Center for Biotechnology Information (NCBI) hosts GenBank, which contains a comprehensive collection of genetic sequence data, including DNA, RNA, and protein sequences submitted by researchers from around the globe (17). The NCBI offers up-to-date non-redundant protein and nucleotide databases (18), which seem to be the most suitable reference databases for analyzing shotgun sequences from environmental DNA samples (19). However, due to their comprehensiveness and regular updates, there are two major concerns.…”
Section: Introductionmentioning
confidence: 99%