2022
DOI: 10.1038/s41467-022-29843-y
|View full text |Cite
|
Sign up to set email alerts
|

A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments

Abstract: Metagenomic binning is the step in building metagenome-assembled genomes (MAGs) when sequences predicted to originate from the same genome are automatically grouped together. The most widely-used methods for binning are reference-independent, operating de novo and enable the recovery of genomes from previously unsampled clades. However, they do not leverage the knowledge in existing databases. Here, we introduce SemiBin, an open source tool that uses deep siamese neural networks to implement a semi-supervised … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
91
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 81 publications
(93 citation statements)
references
References 68 publications
2
91
0
Order By: Relevance
“…SemiBin (1.0.2) [37] was run using the mode with and default parameters otherwise. For the single sample binning the global model was used, except for the CAMI 2 Gastrointestinal tract samples, for which was used and the CAMI 2 Oral samples, for which was used.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…SemiBin (1.0.2) [37] was run using the mode with and default parameters otherwise. For the single sample binning the global model was used, except for the CAMI 2 Gastrointestinal tract samples, for which was used and the CAMI 2 Oral samples, for which was used.…”
Section: Methodsmentioning
confidence: 99%
“…It evaluates clusters based on the presence of lineage-specific single copy marker genes [23]. We benchmarked binny against six CAMI [17,18] data sets and compared the results with the most popular binning methods MetaBAT2 [15], MaxBin2 [14], CONCOCT [13], and the recently developed VAMB [36], SemiBin [37], and MetaDecoder [38]. We evaluated the contribution of binny to automatic MAG refinement using MetaWRAP [27] and DAS Tool [28].…”
Section: Introductionmentioning
confidence: 99%
“…N 50 and L 50 values were 164,736 bp and 628, respectively. Contig binning was performed with MetaBAT v2.12.1 ( 10 ) and SemiBin v0.5.0 ( 11 ) (parameters: ‐‐environment human_gut). Results from both tools were combined with the bin_refinement module implemented in metaWRAP v1.3.2 ( 12 ).…”
Section: Announcementmentioning
confidence: 99%
“…Our strategy is conceptually different from most other comparable methods based on k -mers, which can be broadly divided into the following categories: Methods using specific, possibly very long k -mers to detect the presence or absence of specific sequences, as in Mykrobe [25]. Methods using all k -mers with a low value of k = 4 or 5 and relying on a dimensional reduction in order to cluster and visualise sequences in an abstract small-dimensional space (“binning”, see for instance [26, 27]). Methods using simplified signatures based on some dataset-independent choices (for instance, k -mer “minimizers” as in Mash [28]) to keep signatures (“sketches”) small. Methods such as PopPUNK [29] that use distances, possibly derived from sketches, to summarise the relationships between sequences. …”
Section: Introductionmentioning
confidence: 99%
“…Methods using all k -mers with a low value of k = 4 or 5 and relying on a dimensional reduction in order to cluster and visualise sequences in an abstract small-dimensional space (“binning”, see for instance [26, 27]).…”
Section: Introductionmentioning
confidence: 99%