2022
DOI: 10.3389/fmicb.2022.1042127
|View full text |Cite
|
Sign up to set email alerts
|

Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains

Abstract: Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset consists of 741 sigma70 promoters and 1,400 non-promoters. We have generated a wide range of features around 8,000, which includes Dinucleotide Auto-Correlation, Dinucleotide Cross-Correlation, Dinucleotide Auto Cross-C… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 36 publications
0
3
0
Order By: Relevance
“…A σ70 promoter should harbor two well-defined short DNA elements that are situated at about 10 bps and 35 bps upstream of the TSS (the −10 element and the −35 element, respectively) [19]. The consensus sequences are TTGACA (for −35) and TATAAT (for −10), and the distances between these two elements are usually 17 to 18 bps [20]. Based on these features, we proposed two short sequences, "TTGtgA" (−96 to −91 relative to the fucA start codon; consensus nucleotides are capitalized) and "aATtAa" (−73 to −68) upstream of the TSS, to be the −35 element and the −10 element, respectively (Figure 4B).…”
Section: Determination Of the Transcriptional Start Site (Tss) For Th...mentioning
confidence: 99%
“…A σ70 promoter should harbor two well-defined short DNA elements that are situated at about 10 bps and 35 bps upstream of the TSS (the −10 element and the −35 element, respectively) [19]. The consensus sequences are TTGACA (for −35) and TATAAT (for −10), and the distances between these two elements are usually 17 to 18 bps [20]. Based on these features, we proposed two short sequences, "TTGtgA" (−96 to −91 relative to the fucA start codon; consensus nucleotides are capitalized) and "aATtAa" (−73 to −68) upstream of the TSS, to be the −35 element and the −10 element, respectively (Figure 4B).…”
Section: Determination Of the Transcriptional Start Site (Tss) For Th...mentioning
confidence: 99%
“…An 81-nucleotide long sequence containing the end of SO_1320 and the first eight nucleotides of yadS were used as the query. (Lin et al, 2014;Patiyal et al, 2022). Amino acid sequences of WT and mutant versions of SO_3758 were aligned to the sodiumdependent bicarbonate transporter SbtA from cyanobacteria Synechocystis sp.…”
Section: Bioinformaticsmentioning
confidence: 99%
“…19,20 Multiple machine learning and deep learning methods have been developed for bacterial promoter prediction, including iPromoter-2L, 21 G4PromFinder, 22 iPro70-FMWin, 23 MUL-TiPLy, 24 iProEP, 25 SELECTOR, 26 CNNProm, 26 iPromoter-BnCNN, 27 Promotech, 28 iPro-GAN, 29 PromoterLCNN, 30 iPromoter-CLA, 31 iPro-WAEL, 32 and Sigma70Pred. 33 These methods employ various algorithms and feature encoding schemes to identify promoters and their specific types in different bacterial species. On the other side, bioinformatic tools that utilize the structural architecture of promoters, such as PromPredict, 34 BTSS Finder, 35 and BacPP, 36 leverage structural architecture of DNA and have shown great promise in distinguishing promoters from nonpromoter regions.…”
Section: ■ Introductionmentioning
confidence: 99%
“…However, alternative approaches that consider structural properties hold promise for this task, as they provide discernible biological signals in the context of transcription initiation mechanisms. Machine learning techniques utilizing DNA duplex stability and base stacking energy of promoter, intergenic and transcribed sequences can be compared to demonstrate the unique features of the core promoter providing a high-scale annotation of these essential genetic regions. , Multiple machine learning and deep learning methods have been developed for bacterial promoter prediction, including iPromoter-2L, G4PromFinder, iPro70-FMWin, MUL-TiPLy, iProEP, SELECTOR, CNNProm, iPromoter-BnCNN, Promotech, iPro-GAN, PromoterLCNN, iPromoter-CLA, iPro-WAEL, and Sigma70Pred . These methods employ various algorithms and feature encoding schemes to identify promoters and their specific types in different bacterial species.…”
Section: Introductionmentioning
confidence: 99%