2022
DOI: 10.1101/2022.11.28.518163
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A study on experimental bias in post-translational modification predictors

Abstract: Motivation: With a regulatory impact on numerous biological processes, protein phosphorylation is one of the most studied post-translational modifications. Effective computational methods that provide a sequence-based prediction of probable phosphorylation sites are desirable to guide functional experiments or constrain search spaces for proteomics-based experimental pipelines. Currently, the most successful methods train deep learning models on amino acid composition representations. However, recently propose… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
13
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(13 citation statements)
references
References 59 publications
0
13
0
Order By: Relevance
“…Therefore, the prediction of protein phosphosites is instrumental in elucidating possible regulatory phosphosites where extensive experimental phosphoproteomic data is not available (Meng et al, 2022). While the use of state-of-the-art deep learning and pLMs for protein phosphorylation prediction improved performance over existing methods (Zuallaert et al, 2022), classical machine learning approaches, such as gradient boosting trees, remain as powerful and scalable classification methods (Anghel et al, 2019;Lyashevska et al, 2021). Furthermore, ensemble methods that combine different machine-learning classification methods can improve classification performance by learning from multiple weak classifiers (Dietterich, 1997).…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Therefore, the prediction of protein phosphosites is instrumental in elucidating possible regulatory phosphosites where extensive experimental phosphoproteomic data is not available (Meng et al, 2022). While the use of state-of-the-art deep learning and pLMs for protein phosphorylation prediction improved performance over existing methods (Zuallaert et al, 2022), classical machine learning approaches, such as gradient boosting trees, remain as powerful and scalable classification methods (Anghel et al, 2019;Lyashevska et al, 2021). Furthermore, ensemble methods that combine different machine-learning classification methods can improve classification performance by learning from multiple weak classifiers (Dietterich, 1997).…”
Section: Discussionmentioning
confidence: 99%
“…The PhosBoost achieved a higher recall score at all probability thresholds (Figure 1h). To provide additional support for using the PhosBoost stacking classifier approach, we conducted a similar analysis on an independent dataset, the Ramasamy22 protein phosphorylation dataset, obtained from the PhosphoLingo preprint (Zuallaert et al, 2022).…”
Section: Assessing the Performance Of The Stacking Classifiermentioning
confidence: 99%
See 3 more Smart Citations