2022
DOI: 10.1101/2022.12.06.519132
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Mega-scale experimental analysis of protein folding stability in biology and protein design

Abstract: Advances in DNA sequencing and machine learning are illuminating protein sequences and structures on an enormous scale. However, the energetics driving folding are invisible in these structures and remain largely unknown. The hidden thermodynamics of folding can drive disease, shape protein evolution, and guide protein engineering, and new approaches are needed to reveal these thermodynamics for every sequence and structure. We present cDNA display proteolysis, a new method for measuring thermodynamic folding … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

5
143
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 62 publications
(148 citation statements)
references
References 124 publications
5
143
0
Order By: Relevance
“…The performance of the model on MegaTest set is shown in Table 1. The PCC was 0.85 which approaches the agreement to PCC = 0.91 between trypsin-and chymotrypsin-based data from the Mega dataset (Tsuboyama et al, 2022). For the MegaValidation dataset ABYSSAL showed the same PCC = 0.85, see upper panel of Fig.…”
Section: Performance Of Abyssalsupporting
confidence: 72%
See 4 more Smart Citations
“…The performance of the model on MegaTest set is shown in Table 1. The PCC was 0.85 which approaches the agreement to PCC = 0.91 between trypsin-and chymotrypsin-based data from the Mega dataset (Tsuboyama et al, 2022). For the MegaValidation dataset ABYSSAL showed the same PCC = 0.85, see upper panel of Fig.…”
Section: Performance Of Abyssalsupporting
confidence: 72%
“…We took the experimental data on protein stability changes (∆∆G) upon mutations from (Tsuboyama et al, 2022) where ∆∆G values were estimated from cleavages by proteases. We downloaded the file "K50_dG_Dataset1_Dataset2.csv" from the Zenodo repository https://zenodo.org/record/7401275#.Y6st59JBxD_ associated with the paper (Tsuboyama et al, 2022). Out of 851,552 stability change data for 542 reference proteins, we removed records (i) having tag 'unreliable', (ii) when no mutation was introduced, (iii) associated with insertions and/or deletions, (iv) associated with multiple mutations.…”
Section: Datasetmentioning
confidence: 99%
See 3 more Smart Citations