2022
DOI: 10.1128/msystems.01180-21
|View full text |Cite
|
Sign up to set email alerts
|

PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest

Abstract: Antimicrobial resistance is a global health threat to humans and animals, causing high mortality and morbidity while effectively ending decades of success in fighting against bacterial infections. Plasmids confer extra genetic capabilities to the host organisms through accessory genes that can encode antimicrobial resistance and virulence.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(12 citation statements)
references
References 40 publications
0
12
0
Order By: Relevance
“…We compared HOTSPOT with three state-of-the-art tools designed for plasmid host prediction: MOB-typer ( Robertson et al 2020 ), PlasmidHostFinder ( Aytan-Aktug et al 2022 ), and PlasFlow ( Krawczyk et al 2018 ), all of which are briefly described in Section 1.1. PlasFlow can only output phylum-level results.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We compared HOTSPOT with three state-of-the-art tools designed for plasmid host prediction: MOB-typer ( Robertson et al 2020 ), PlasmidHostFinder ( Aytan-Aktug et al 2022 ), and PlasFlow ( Krawczyk et al 2018 ), all of which are briefly described in Section 1.1. PlasFlow can only output phylum-level results.…”
Section: Resultsmentioning
confidence: 99%
“…They mainly adopt k -mers, GC-content, and codon usage as features. For example, PlasmidHostFinder ( Aytan-Aktug et al 2022 ) utilizes plasmid genomic signatures, such as 8-mer counts and codon frequencies, to train four random forest models corresponding to the order, family, genus, and species level. PlasFlow ( Krawczyk et al 2018 ) trained neural networks using normalized k -mer frequency vector to predict phylum-level plasmid hosts.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Plasmid hosts, on the other hand, due to the competence of plasmids (including self-replication, etc. ), are difficult to identify ( Suzuki et al, 2010 ) or can only be classified against existing databases ( Aytan-Aktug et al, 2022 ).…”
Section: Methodsmentioning
confidence: 99%
“…However, these tools can perform poorly with contigs assembled from metagenomes because they are not designed for identifying incomplete sequences ( 4 , 12 15 ). On the other hand, machine-learning-based approaches can be designed for identifying extrachromosomal elements using short sequence-based matching, or by identifying sequence characteristics such as GC content, codon usage, and gene density ( 12 , 14 , 16 , 17 ). Machine-learning models developed for both WGS and metagenomic reads use various algorithms such as support vector machines ( 18 ), logistic regressions ( 12 ), random forests ( 19 ), and artificial neural networks ( 14 , 15 , 20 23 ).…”
Section: Introductionmentioning
confidence: 99%