2024
DOI: 10.1101/2024.01.31.578102
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SPIRED-Fitness: an end-to-end framework for the prediction of protein structure and fitness from single sequence

Yinghui Chen,
Yunxin Xu,
Di Liu
et al.

Abstract: Significant research progress has been made in the field of protein structure and fitness prediction. Particularly, single-sequence-based structure prediction methods like ESMFold and OmegaFold achieve a balance between inference speed and prediction accuracy, showing promise for many downstream prediction tasks. Here, we propose SPIRED, a novel single-sequence-based structure prediction model that exhibits comparable performance to the state-of-the-art methods but with approximately 5-fold acceleration in inf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 69 publications
0
3
0
Order By: Relevance
“…Hence, the non-statistical-potential-style loss function in GDFold2 successfully complements the 2D geometric prediction results by statistics derived from high-resolution protein structures, nearly without causing information loss. More GDFold2 folding cases could be found in the Supplementary Materials of our SPIRED paper 21 . Unlike SPIRED, Cerebra prediction results involve full information required for all-atom structural reconstruction.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Hence, the non-statistical-potential-style loss function in GDFold2 successfully complements the 2D geometric prediction results by statistics derived from high-resolution protein structures, nearly without causing information loss. More GDFold2 folding cases could be found in the Supplementary Materials of our SPIRED paper 21 . Unlike SPIRED, Cerebra prediction results involve full information required for all-atom structural reconstruction.…”
Section: Resultsmentioning
confidence: 99%
“…We employed the highly efficient GDFold2 pipeline to generate a large amount of decoy structures based on randomly masked SPIRED and Cerebra predictions for a number of proteins in their training sets (see Methods ) and used these decoys to train the QA model. Consequently, to avoid information leaking, during the evaluation of our QA model, we chose proteins in the CAMEO testing dateset 21 that are absent in the SPIRED/Cerebra training sets, and selected RoseTTAFold predictions as constraints to generate 24 structural models by GDFold2 for each protein target. Generally, the structural models produced for each individual protein target present certain degree of structural dissimilarity, with an average deviation ( i .…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation