2023
DOI: 10.1021/acs.jproteome.2c00667
|View full text |Cite
|
Sign up to set email alerts
|

LMPhosSite: A Deep Learning-Based Approach for General Protein Phosphorylation Site Prediction Using Embeddings from the Local Window Sequence and Pretrained Protein Language Model

Abstract: Phosphorylation is one of the most important post-translational modifications and plays a pivotal role in various cellular processes. Although there exist several computational tools to predict phosphorylation sites, existing tools have not yet harnessed the knowledge distilled by pretrained protein language models. Herein, we present a novel deep learning-based approach called LMPhosSite for the general phosphorylation site prediction that integrates embeddings from the local window sequence and the contextua… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(14 citation statements)
references
References 57 publications
0
14
0
Order By: Relevance
“…These predictors traditionally use one-hot encoded representations, with a combination of a CNN module and a capsule network module (MusiteDeep 18 ), three densely connected CNN blocks (DeepPhos 19 ), or a local context module and a global module built from convolutional and LSTM layers, and a squeeze-and-excitation network (DeepPSP 13 ). Finally, LMPhosSite 28 uses a CNN module on a trainable embedding, in combination with a fully-connected neural network built on the pLM representation for the single phosphosite position. For more information, see the Methods section, and their respective publications.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…These predictors traditionally use one-hot encoded representations, with a combination of a CNN module and a capsule network module (MusiteDeep 18 ), three densely connected CNN blocks (DeepPhos 19 ), or a local context module and a global module built from convolutional and LSTM layers, and a squeeze-and-excitation network (DeepPSP 13 ). Finally, LMPhosSite 28 uses a CNN module on a trainable embedding, in combination with a fully-connected neural network built on the pLM representation for the single phosphosite position. For more information, see the Methods section, and their respective publications.…”
Section: Resultsmentioning
confidence: 99%
“…Compared to the LMPhosSite tool that also employs ProtT5-XL-U50 based representations but with a different model, our CNN model showed an improvement of 2.3% to 8.9% in terms of AUPRC, except again on multi-source-Y, where performance was roughly equal. 39 , data from multiple traditional sources (UniProtKB/SwissProt, dbPTM 40 , Phospho.ELM 41 , and PhosphoSitePlus 42 ) used as the benchmark in other phosphosite prediction publications 13,28 , and combined S/T phosphosites from individual experiments with different proteases 1 . Average curves over 10 runs (per dataset) are depicted.…”
Section: Plms Substantially Improve Ptm Site Prediction Accuracymentioning
confidence: 99%
See 2 more Smart Citations
“…An important process while developing a ML/DL model for the prediction of protein O-linked glycosylation sites is representing the primary amino acids with fixed size pertinent feature embeddings via a suitable encoding scheme 48,58 . We employed a pretrained PLM to generate per-residue contextualized embeddings for this investigation.…”
Section: Feature Encodingmentioning
confidence: 99%