Cryogenic Electron Microscopy (cryo-EM) preserves the ensemble of protein conformations in solution and thus provide a promising way to characterize conformational changes underlying protein functions. However, it remains challenging for existing software to elucidate distributions of multiple conformations from a heterogeneous cryo-EM dataset. We developed a new algorithm: Linear Combinations of Template Conformations (LCTC) to obtain distributions of multiple conformations from cryo-EM datasets. LCTC assigns 2D images to the template 3D structures obtained by Multi-body Refinement of RELION via a novel two-stage matching algorithm. Specifically, an initial rapid assignment of experimental 2D images to template 2D images was applied based on auto-correlation functions of image contours that can efficiently remove the majority of irrelevant 2D images. This is followed by pixel-pixel matching of images with fewer number of 2D images, which can accurately assign the 2D images to the template images. We validate the LCTC method by demonstrating that it can accurately reproduce the distributions of 3 Thermus aquaticus (Taq) RNA polymerase (RNAP) structures with different degrees of clamp opening from a simulated cryo-EM dataset, in which the correct distributions are known. For this dataset, we also show that LCTC greatly outperforms clustering-based Manifold Embedding and Maximum Likelihood-based Multi-body Refinement algorithms in terms of reproducing the structural distributions. Lastly, we also successfully applied LCTC to reveal the populations of various clamp-opening conformations from an experimental Escherichia coli RNAP cryo-EM dataset. Source code is available at https://github.com/ghl1995/LCTC.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.