2020
DOI: 10.1101/2020.07.31.230631
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

RFPlasmid: Predicting plasmid sequences from short read assembly data using machine learning

Abstract: Antimicrobial resistance (AMR) genes in bacteria are often carried on plasmids and these plasmids can transfer AMR genes between bacteria. For molecular epidemiology purposes and risk assessment, it is important to know if the genes are located on highly transferable plasmids or in the more stable chromosomes. However, draft whole genome sequences are fragmented, making it difficult to discriminate plasmid and chromosomal contigs. Current methods that predict plasmid sequences from draft genome sequences rely … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(15 citation statements)
references
References 40 publications
0
15
0
Order By: Relevance
“…Mlplasmids outperformed both cBAR and PlasFlow when classifying data derived from whole-genome sequencing experiments, and it can also accurately predict the plasmid localization of several antimicrobial resistance genes [ 29 ]. RFPlasmid [ 32 ], a recently released tool, uses a random forest classifier trained with a hybrid approach by identifying chromosomal and plasmids marker genes using two databases and also pentamer frequencies. This tool also works with metagenomic assemblies, albeit only for contigs from the 17 different species for which classifiers were trained.…”
Section: Resultsmentioning
confidence: 99%
“…Mlplasmids outperformed both cBAR and PlasFlow when classifying data derived from whole-genome sequencing experiments, and it can also accurately predict the plasmid localization of several antimicrobial resistance genes [ 29 ]. RFPlasmid [ 32 ], a recently released tool, uses a random forest classifier trained with a hybrid approach by identifying chromosomal and plasmids marker genes using two databases and also pentamer frequencies. This tool also works with metagenomic assemblies, albeit only for contigs from the 17 different species for which classifiers were trained.…”
Section: Resultsmentioning
confidence: 99%
“…All regions identified as non-core were compared with previously identified complete plasmids from an NCBI-based plasmid database [ 44 ] with BLAST [ 45 ]. In addition, RFPlasmid ( , accessed on 12 April 2021) was used to predict whether a contig in each assembly was likely part of a plasmid [ 46 ].…”
Section: Methodsmentioning
confidence: 99%
“…Circular maps were drawn with CGView Server ( (accessed on 18 March 2021)) [ 35 ]. The contigs did not include plasmid-associated contigs identified via RFPlasmid ( (accessed on 18 March 2021)) [ 33 ], were re-ordered with Mauve Contig Mover [ 34 ], and were manually inspected as described in Materials and Methods. The genomes were annotated with Prokka [ 32 ], yielding genetic features that are shown in the circular maps as rectangular boxes in the two outermost circles separated by the dark gray line.…”
Section: Figurementioning
confidence: 99%
“…The contigs were annotated with Prokka (version 1.14.5; included in the Docker image available at https://hub.docker.com/r/staphb/prokka (accessed on 18 March 2021)) [32]. Plasmid-borne contigs were identified using RFPlasmid (http://klif.uu.nl/rfplasmid/ (accessed on 18 March 2021)) [33] and eliminated from further analysis. The plasmid-free contigs were re-ordered with Mauve Contig Mover included in Mauve (version 20150226 build 10 (c)) [34] against the complete genome of L. welshimeri SLCC5334 (accession no.…”
Section: Whole Genome Sequencing (Wgs) and Analysismentioning
confidence: 99%
See 1 more Smart Citation