Motivation:With the breakthrough of AlphaFold2 and the publication of AlphaFold DB, the protein structure prediction has made remarkable progress, which may further promote many potential applications of proteomics in all areas of life. However, it should be noted that AlphaFold2 models tend to represent only a single static structure, and accurately predicting multiple conformations remains a challenge. Therefore, it is essential to develop methods for predicting multiple conformations, which enable us to gain knowledge of multiple conformational states and the broader conformational landscape to better understand the mechanism of action.
Results:In this work, we proposed a multiple conformational states folding method using the distance-based multi-objective evolutionary algorithm framework, named MultiSFold. First, a multi-objective energy landscape with multiple competing constraints generated by deep learning is constructed. Then, an iterative modal exploration and exploitation strategy based on multi-objective optimization, geometric optimization and structural similarity clustering is designed to perform conformational sampling. Finally, the final population is generated using a loop-specific perturbation strategy to adjust the spatial orientations. MultiSFold was compared with state-of-the-art methods on a developed benchmark testset containing 81 proteins with two representative conformational states. Based on the proposed metric, the success ratio of MultiSFold predicting multiple conformations was 70.4% while that of AlphaFold2 was 9.88%, which may indicate that conformational sampling combined with knowledge gained through deep learning has the potential to produce conformations spanned the range between two experimental structures. In addition, MultiSFold was tested on 244 human proteins with low structural accuracy in AlphaFold DB to test whether it could further improve the accuracy of static structures. The experimental results demonstrate that the TM-score of MultiSFold is 2.97% and 7.72% higher than that of AlphaFold2 and RoseTTAFold, respectively, supporting our hypothesis that multiple competing optimization objectives can further assist conformational search to improve prediction accuracy.