2022
DOI: 10.1101/2022.07.06.499043
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

An interpretable machine learning algorithm to predict disordered protein phase separation based on biophysical interactions

Abstract: Protein phase separation is increasingly understood to be an important mechanism of biological organization and biomaterial formation. Intrinsically disordered protein regions (IDRs) are often significant drivers of protein phase separation. A number of protein phase separation prediction algorithms are available, with many specific for particular classes of proteins and others providing results that are not amenable to interpretation of contributing biophysical interactions. Here we describe LLPhyScore, a new… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 74 publications
0
6
0
Order By: Relevance
“…We used this whole protein metric (the summed classifier distance of P-labeled windows) to create a recall plot, used to assess prediction performance, for multiple datasets (Figures 4C, S5). The success in recall plots is typically quantified using the area under the curve (AUC), when comparing a test dataset to a comparison dataset (47, 76, 77). Here, in all cases, we used the human proteome as the comparison dataset.…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…We used this whole protein metric (the summed classifier distance of P-labeled windows) to create a recall plot, used to assess prediction performance, for multiple datasets (Figures 4C, S5). The success in recall plots is typically quantified using the area under the curve (AUC), when comparing a test dataset to a comparison dataset (47, 76, 77). Here, in all cases, we used the human proteome as the comparison dataset.…”
Section: Resultsmentioning
confidence: 99%
“…On their datasets, ParSe performs similarly as measured by AUC scores, to PScore (16), CatGranule (34), and PLAAC (32) in identifying proteins that drive LLPS (Figure S9A-C). The quality of the test one can make of these predictors depends significantly on the quality of the datasets, and so a true test of these predictors will require significantly more experimental data from both positive and negative controls (31, 47).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations