Prediction of multiple conformational states by combining sequence clustering with AlphaFold2

Wayment-Steele, Hannah K.; Овчинников, С. Г.; Colwell, Lucy; Kern, Dorothee

doi:10.1101/2022.10.17.512570

Cited by 97 publications

(148 citation statements)

References 60 publications

Supporting

Mentioning

143

Contrasting

Unclassified

Order By: Relevance

“…Predictions of fold-switchers have moderately reduced pLDDT versus single-fold proteins, but the pLDDTs are still substantially higher than for disordered proteins or disordered regions of specific proteins. Even more strikingly, Wayment-Steele and colleagues have shown that by clustering the sequences used by AlphaFold2, the algorithm can actually predict both folds in specific fold-switching systems 20 . This exciting result suggests that predictions of structural distributions may not be far off, as AlphaFold2 can already produce multiple correct structural outputs for the same protein.…”

Section: A Single Structure Cannot Capture Functional Motionmentioning

confidence: 99%

Protein structure prediction has reached the single-structure frontier

Lane

2023

Nat Methods

View full text Add to dashboard Cite

Dramatic advances in protein structure prediction have sparked debate as to whether the problem of predicting structure from sequence is solved or not. Here, I argue that AlphaFold2 and its peers are currently limited by the fact that they predict only a single structure, instead of a structural distribution, and that this realization is crucial for the next generation of structure prediction algorithms.

show abstract

Section: A Single Structure Cannot Capture Functional Motionmentioning

confidence: 99%

Protein structure prediction has reached the single-structure frontier

Lane

2023

Nat Methods

View full text Add to dashboard Cite

show abstract

“…The model has been made available to the public with DeepMind’s official open-source implementation, which has been used to predict the structures of hundreds of millions of proteins (Tunyasuvunakool et al 2021, Varadi et al 2021, Callaway 2022). This implementation has enabled researchers to optimize AlphaFold2’s prediction procedure and user experience (Mirdita, Schütze, et al 2022) and to employ it as a module within novel algorithms, including ones for protein complex prediction (Baek 2021), peptide-protein interactions (Tsaban et al 2022), structure ranking (Roney and Ovchinnikov 2022), and more ( e.g ., Baltzis et al 2022, Bryant et al 2022, Wayment-Steele et al 2022).…”

Section: Introductionmentioning

confidence: 99%

OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

Ahdritz

Bouatta

Kadyan

et al. 2022

Preprint

131

109

View full text Add to dashboard Cite

AlphaFold2 revolutionized structural biology with the ability to predict protein structures with exceptionally high accuracy. Its implementation, however, lacks the code and data required to train new models. These are necessary to (i) tackle new tasks, like protein-ligand complex structure prediction, (ii) investigate the process by which the model learns, which remains poorly understood, and (iii) assess the model's generalization capacity to unseen regions of fold space. Here we report OpenFold, a fast, memory-efficient, and trainable implementation of AlphaFold2, and OpenProteinSet, the largest public database of protein multiple sequence alignments. We use OpenProteinSet to train OpenFold from scratch, fully matching the accuracy of AlphaFold2. Having established parity, we assess OpenFold's capacity to generalize across fold space by retraining it using carefully designed datasets. We find that OpenFold is remarkably robust at generalizing despite extreme reductions in training set size and diversity, including near-complete elisions of classes of secondary structure elements. By analyzing intermediate structures produced by OpenFold during training, we also gain surprising insights into the manner in which the model learns to fold proteins, discovering that spatial dimensions are learned sequentially. Taken together, our studies demonstrate the power and utility of OpenFold, which we believe will prove to be a crucial new resource for the protein modeling community.

show abstract

“…Recently different works showed results of increased capacity of AF2 to reproduce protein conformational diversity using alignment subsampling (Del Alamo et al, 2022; Wayment-Steele et al, 2022). In an attempt to increase the information of evolutionary trajectories in the input used by AF2, we used ancestral reconstruction prediction (Joy et al, 2016).…”

Section: Resultsmentioning

confidence: 99%

Conformational epistasis impairs AlphaFold structural predictions

Sawicki

Benítez

Carletti

et al. 2022

Preprint

View full text Add to dashboard Cite

Protein structures have been massively predicted using homologous sequence information. AlphaFold2 (AF2) is a recent breakthrough to predict 3D models using machine learning approaches that reached an outstanding accuracy in recent quality evaluations. However, information derived from extant homologous sequences, as those used by AF2, might not contain enough information to accurately predict protein structure. This limitation could be related to the process known as epistasis, which describes the differential effect of a mutation on the evolutionary trajectory. Clear evidence of conformational epistasis, which has a specific impact on protein structure, was characterized in the evolutionary origin of the glucocorticoid receptor (GR) specificity during its functional divergence from the mineralocorticoid (MR) receptor. In this work we explore how AF2 can reproduce conformations derived from epistatic effects. Using structural clustering and principal component analysis to analyze the structural similarities in 16 and 13 extant GR and MR conformers, respectively, we found that AF2 models for human GR failed to reproduce extant GR conformations. Interestingly, AF2 models for human MR, for which no conformational epistasis was reported, were almost indistinguishable from extant MR. Our results showcase the importance of evolutionary trajectories to predict accurate 3D models.

show abstract

Prediction of multiple conformational states by combining sequence clustering with AlphaFold2

Cited by 97 publications

References 60 publications

Protein structure prediction has reached the single-structure frontier

Protein structure prediction has reached the single-structure frontier

OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

Conformational epistasis impairs AlphaFold structural predictions

Contact Info

Product

Resources

About