2020
DOI: 10.1038/s41598-020-63424-7
|View full text |Cite
|
Sign up to set email alerts
|

gammaBOriS: Identification and Taxonomic Classification of Origins of Replication in Gammaproteobacteria using Motif-based Machine Learning

Abstract: The biology of bacterial cells is, in general, based on information encoded on circular chromosomes. Regulation of chromosome replication is an essential process that mostly takes place at the origin of replication (oriC), a locus unique per chromosome. Identification of high numbers of oriC is a prerequisite for systematic studies that could lead to insights into oriC functioning as well as the identification of novel drug targets for antibiotic development. Current methods for identifying oriC sequences rely… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 66 publications
0
11
0
Order By: Relevance
“…As shown in Fig. 6 , there are big differences in terms of mononucleotide composition, dinucleotide composition and trinucleotide composition between the flanking sequences of eukaryotic ORIs and bacterial ORIs, which may be the reason for the different findings of the recent publication 51 and this study. The flanking sequences of bacterial ORIs constructed by Sperlea et al 51 can be available in the supplementary file.…”
Section: Results and Disccusionmentioning
confidence: 68%
See 1 more Smart Citation
“…As shown in Fig. 6 , there are big differences in terms of mononucleotide composition, dinucleotide composition and trinucleotide composition between the flanking sequences of eukaryotic ORIs and bacterial ORIs, which may be the reason for the different findings of the recent publication 51 and this study. The flanking sequences of bacterial ORIs constructed by Sperlea et al 51 can be available in the supplementary file.…”
Section: Results and Disccusionmentioning
confidence: 68%
“…In the recent publication 51 , RFs trained on word2vec-derived encodings show unsatisfactory performance on bacterial ORIs. In this study, the word2vec combined with 2D-CNN achieves excellent identification results on eukaryotic ORIs.…”
Section: Results and Disccusionmentioning
confidence: 99%
“…To determine if the studied genetic elements in H. halochloris are transcribed from leading or lagging strands, we searched for the origin of replication (OriC). GammaBOriS tool [30] [20].…”
Section: Resultsmentioning
confidence: 99%
“…In the case of group IIC introns, the formation of secondary structures is crucial for insertion [9]. Based on GammaBOriS [30] identi cation of the origin of replication in H. halochloris, H.ha.F1 and H.ha.2 are inserted within the leading strand rather than the lagging strand; a documented yet rare phenomena [1]. Furthermore, despite the above mentioned reliance of En -IEPs group II introns on host replication machinery for complete retrohoming and retrotransposition, a possible minor retrohoming pathway independent of DNA replication can exist, at which introns can reverse splice into double stranded (ds) or transiently ssDNA target sites [1].…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation