SARS-CoV-2 is the causative agent of COVID-19. The dimeric form of the viral main protease is responsible for the cleavage of the viral polyprotein in 11 sites, including its own N and C-terminus. Although several mechanisms of self-cleavage had been proposed for SARS-CoV, the lack of structural information for each step is a setback to the understanding of this process. Herein, we used X-ray crystallography to characterize an immature form of the main protease, which revealed major conformational changes in the positioning of domain-three over the active site, hampering the dimerization and diminishing its activity. We propose that this form preludes the cis-cleavage of N-terminal residues within the dimer, leading to the mature active site. Using fragment screening, we probe new cavities in this form which can be used to guide therapeutic development. Furthermore, we characterized a serine site-directed mutant of the main protease bound to its endogenous N and C-terminal residues during the formation of the tetramer. This quaternary form is also present in solution, suggesting a transitional state during the C-terminal trans-cleavage. This data sheds light in the structural modifications of the SARS-CoV-2 main protease during maturation, which can guide the development of new inhibitors targeting its intermediary states.
Herein we provide a living summary of the data generated during the COVID Moonshot project focused on the development of SARS-CoV-2 main protease (Mpro) inhibitors. Our approach uniquely combines crowdsourced medicinal chemistry insights with high throughput crystallography, exascale computational chemistry infrastructure for simulations, and machine learning in triaging designs and predicting synthetic routes. This manuscript describes our methodologies leading to both covalent and non-covalent inhibitors displaying protease IC50 values under 150 nM and viral inhibition under 5 uM in multiple different viral replication assays. Furthermore, we provide over 200 crystal structures of fragment-like and lead-like molecules in complex with the main protease. Over 1000 synthesized and ordered compounds are also reported with the corresponding activity in Mpro enzymatic assays using two different experimental setups. The data referenced in this document will be continually updated to reflect the current experimental progress of the COVID Moonshot project, and serves as a citable reference for ensuing publications. All of the generated data is open to other researchers who may find it of use.
The growing quantity of public and private data sets focused on small molecules screened against biological targets or whole organisms provides a wealth of drug discovery relevant data. This is matched by the availability of machine learning algorithms such as Support Vector Machines (SVM) and Deep Neural Networks (DNN) that are computationally expensive to perform on very large data sets with thousands of molecular descriptors. Quantum computer (QC) algorithms have been proposed to offer an approach to accelerate quantum machine learning over classical computer (CC) algorithms, however with significant limitations. In the case of cheminformatics, which is widely used in drug discovery, one of the challenges to overcome is the need for compression of large numbers of molecular descriptors for use on a QC. Here, we show how to achieve compression with data sets using hundreds of molecules (SARS-CoV-2) to hundreds of thousands of molecules (whole cell screening data sets for plague and M. tuberculosis) with SVM and the data reuploading classifier (a DNN equivalent algorithm) on a QC benchmarked against CC and hybrid approaches. This study illustrates the steps needed in order to be “quantum computer ready” in order to apply quantum computing to drug discovery and to provide the foundation on which to build this field.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.