2022
DOI: 10.1038/s41587-021-01156-3
|View full text |Cite
|
Sign up to set email alerts
|

SignalP 6.0 predicts all five types of signal peptides using protein language models

Abstract: Signal peptides (SPs) are short amino acid sequences that control protein secretion and translocation in all living organisms. SPs can be predicted from sequence data, but existing algorithms are unable to detect all known types of SPs. We introduce SignalP 6.0, a machine learning model that detects all five SP types and is applicable to metagenomic data.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

9
1,013
0
3

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 1,466 publications
(1,025 citation statements)
references
References 36 publications
9
1,013
0
3
Order By: Relevance
“…HHpred raw data was parsed using in-house scripts to generate the architecture of each protein in terms of non-overlapping homologous regions to 2J58 and 3P42. To identify the type of signal peptide and the cleavage site location, each OPX protein was subject to SignalP 6.0 analysis (Teufel et al ., 2022) and to predict the membrane topology of the proteins, TMHMM (Server v. 2.0) (Krogh et al ., 2001) was used.…”
Section: Methodsmentioning
confidence: 99%
“…HHpred raw data was parsed using in-house scripts to generate the architecture of each protein in terms of non-overlapping homologous regions to 2J58 and 3P42. To identify the type of signal peptide and the cleavage site location, each OPX protein was subject to SignalP 6.0 analysis (Teufel et al ., 2022) and to predict the membrane topology of the proteins, TMHMM (Server v. 2.0) (Krogh et al ., 2001) was used.…”
Section: Methodsmentioning
confidence: 99%
“…Alignment was prepared with Clustal Omega online ( https://www.ebi.ac.uk/Tools/msa/clustalo/ ; accessed January 15, 2022). The following elements are indicated, from N to C end, on SntA precursor sequence: signal peptide (gray; amino acids 1–27) as predicted by SignalP 6.0 ( Teufel et al, 2022 ); metallophos domain (green; amino acids 113–426); interdomain linker (orange; amino acids 427–446); 5_nucleotid_C domain (cyan; amino acids 447–680); and C-terminal sorting signal (gray) comprising a LPXTG motif (amino acids 783–787) and a hydrophobic domain with a positively charged tail (amino acids 792–813). SntA core used to run substrate specificity and kinetics experiments corresponds to amino acids 111–727 (shown within borders), which align well with mature CpdB.…”
Section: Introductionmentioning
confidence: 99%
“…The signal peptide of the PNX precursors is unusual. We could only detect it by SignalP3.0 ( Bendtsen et al 2004 ) and not by SignalP4.1 ( Petersen et al 2011 ), SignalP5.0 ( Almagro-Armenteros et al 2019b ), or SignalP6.0 ( Teufel et al 2022 ). These newer versions of the software failed to identify the signal peptide even in the human PNX precursor protein where propeptide cleavage has been experimentally demonstrated ( Yosten et al 2013a ) or in the PNX precursor from N. norvegicus ( Nguyen et al 2018 ).…”
Section: Resultsmentioning
confidence: 92%