2022
DOI: 10.3389/fmicb.2021.813094
|View full text |Cite
|
Sign up to set email alerts
|

T1SEstacker: A Tri-Layer Stacking Model Effectively Predicts Bacterial Type 1 Secreted Proteins Based on C-Terminal Non-repeats-in-Toxin-Motif Sequence Features

Abstract: Type 1 secretion systems play important roles in pathogenicity of Gram-negative bacteria. However, the substrate secretion mechanism remains largely unknown. In this research, we observed the sequence features of repeats-in-toxin (RTX) proteins, a major class of type 1 secreted effectors (T1SEs). We found striking non-RTX-motif amino acid composition patterns at the C termini, most typically exemplified by the enriched “[FLI][VAI]” at the most C-terminal two positions. Machine-learning models, including deep-l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 45 publications
(72 reference statements)
0
4
0
Order By: Relevance
“…No published multiclass prediction tool has considered all 5 major types (I to IV and VI) of secreted substrates of Gram-negative bacteria. Here, we selected 7 state-of-the-art binary predictors (T1SEstacker [ 15 ] for T1SE; BEAN 2.0 [ 17 ], Bastion3 [ 18 ], T3SEpp [ 19 ], and EP3 [ 21 ] for T3SE; Bastion4 [ 27 ], CNN-T4SE [ 28 ], iT4SE-EP [ 30 ], and T4SEfinder [ 31 ] for T4SE; and Bastion6 [ 32 ] for T6SE) for performance comparison. We used DeepSecE to predict the protein sequences in independent datasets and recorded the classification metrics.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…No published multiclass prediction tool has considered all 5 major types (I to IV and VI) of secreted substrates of Gram-negative bacteria. Here, we selected 7 state-of-the-art binary predictors (T1SEstacker [ 15 ] for T1SE; BEAN 2.0 [ 17 ], Bastion3 [ 18 ], T3SEpp [ 19 ], and EP3 [ 21 ] for T3SE; Bastion4 [ 27 ], CNN-T4SE [ 28 ], iT4SE-EP [ 30 ], and T4SEfinder [ 31 ] for T4SE; and Bastion6 [ 32 ] for T6SE) for performance comparison. We used DeepSecE to predict the protein sequences in independent datasets and recorded the classification metrics.…”
Section: Resultsmentioning
confidence: 99%
“…In silico methods, particularly machine-learning-based approaches, have been developed and used to identify and distinguish secreted substrates from nonsecreted proteins [ 3 , 13 ]. For instance, some tools identify different types of secreted proteins, including type I secreted proteins (T1SEs) [ 14 , 15 ], T3SEs [ 16 22 ], T4SEs [ 23 31 ], and T6SEs [ 32 , 33 ]. Position-specific scoring matrix (PSSM)-based methods, such as Bastion3 [ 18 ], convolutional neural network (CNN)-T4SE [ 28 ], and Bastion6 [ 32 ], usually achieve a sound predictive performance.…”
Section: Introductionmentioning
confidence: 99%
“…The raw and frameshifting peptide sequences were predicted with T4SEpre (Wang et al, 2014), T1SEstacker (Chen et al, 2022) and SignalP 6.0 (Teufel et al, 2022) respectively.…”
Section: Low Tolerance Of Type IV Type I and Sec/tat Secretion Signal...mentioning
confidence: 99%
“…The C-terminal 100-aa peptides were retrieved from the raw and frameshift-mutated type IV effectors, and predicted by T4SEpre with a cutoff of 3/3, that is, positive prediction by all the three modules, T4SEpre_bpbaac, T4SEpre_pseaac and T4SEpre_joint (Wang et al, 2014). T1SEstacker was used to predict the C-terminal 60-aa fragments of the raw and frameshift-mutated type I secreted proteins, with a default cutoff (>=3/6) (Chen et al, 2022). For the Sec/Tat signal peptides, we used SignalP 6.0 and the default settings to screen all possible Sec/Tat substrates from the raw and frameshift-mutated proteome of E. coli MG1655, S. typhimurium 14028S, Y. entericolitica 8081 and R. solanacearum GMI1000 (Teufel et al, 2022).…”
Section: Tolerance Of Frameshift Mutations In Mrna Sequences Encoding...mentioning
confidence: 99%