2018
DOI: 10.1093/bioinformatics/bty880
|View full text |Cite
|
Sign up to set email alerts
|

A natural upper bound to the accuracy of predicting protein stability changes upon mutations

Abstract: Accurate prediction of protein stability changes upon single-site variations (G) is important for protein design, as well as our understanding of the mechanism of genetic diseases. The performance of high-throughput computational methods to this end is evaluated mostly based on the Pearson correlation coefficient between predicted and observed data, assuming that the upper bound would be 1 (perfect correlation). However, the performance of these predictors can be limited by the distribution and noise of the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

2
58
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 40 publications
(60 citation statements)
references
References 33 publications
2
58
0
Order By: Relevance
“…For example, a number of studies have compared ΔΔG predictors, showing heterogeneous correlations with experimental values on the order of R = 0.5 for many predictors 12 , 13 , 65 . However, a recent work has also revealed problems with the noise in experimental stability data used to benchmark the prediction methods, generally assessed through correlation values 66 . Taking noise and data distribution limitations into account, it is estimated that with currently available experimental data the best ΔΔG predictor output correlations should be in the range 0.7–0.8, while higher values would suggest overfitting 66 .…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, a number of studies have compared ΔΔG predictors, showing heterogeneous correlations with experimental values on the order of R = 0.5 for many predictors 12 , 13 , 65 . However, a recent work has also revealed problems with the noise in experimental stability data used to benchmark the prediction methods, generally assessed through correlation values 66 . Taking noise and data distribution limitations into account, it is estimated that with currently available experimental data the best ΔΔG predictor output correlations should be in the range 0.7–0.8, while higher values would suggest overfitting 66 .…”
Section: Discussionmentioning
confidence: 99%
“…However, a recent work has also revealed problems with the noise in experimental stability data used to benchmark the prediction methods, generally assessed through correlation values 66 . Taking noise and data distribution limitations into account, it is estimated that with currently available experimental data the best ΔΔG predictor output correlations should be in the range 0.7–0.8, while higher values would suggest overfitting 66 . As such, even assuming that ‘true’ ΔΔG values were perfectly correlated with mutation pathogenicity, we would still expect these computational predictors to misclassify many variants.…”
Section: Discussionmentioning
confidence: 99%
“…For example, a number of studies have compared ΔΔG predictors, showing heterogeneous correlations with experimental values on the order of R=0.5 for many predictors 12,13,60 . However, a recent work has also revealed problems with the noise in experimental stability data used to benchmark the prediction methods, generally assessed through correlation values 61 . Taking noise and data distribution limitations into account, it is estimated that with currently available experimental data the best ΔΔG predictor output correlations should be in the range 0.7-0.8, while higher values would suggest overfitting 61 .…”
Section: Discussionmentioning
confidence: 99%
“…However, a recent work has also revealed problems with the noise in experimental stability data used to benchmark the prediction methods, generally assessed through correlation values 61 . Taking noise and data distribution limitations into account, it is estimated that with currently available experimental data the best ΔΔG predictor output correlations should be in the range 0.7-0.8, while higher values would suggest overfitting 61 . As such, even assuming that 'true' ΔΔG values were perfectly correlated with mutation pathogenicity, we would still expect these computational predictors to misclassify many variants.…”
Section: Discussionmentioning
confidence: 99%
“…This result is consistent with a recent theoretical estimate of a natural upper bound of the accuracy of DDG predictions. 50 These results suggest that there may still be room for improvement of computational DDG predictions.…”
Section: Prediction Of Conformational Stabilitymentioning
confidence: 97%