SARS-CoV-2 is the causative agent of COVID-19. The dimeric form of the viral main protease is responsible for the cleavage of the viral polyprotein in 11 sites, including its own N and C-terminus. Although several mechanisms of self-cleavage had been proposed for SARS-CoV, the lack of structural information for each step is a setback to the understanding of this process. Herein, we used X-ray crystallography to characterize an immature form of the main protease, which revealed major conformational changes in the positioning of domain-three over the active site, hampering the dimerization and diminishing its activity. We propose that this form preludes the cis-cleavage of N-terminal residues within the dimer, leading to the mature active site. Using fragment screening, we probe new cavities in this form which can be used to guide therapeutic development. Furthermore, we characterized a serine site-directed mutant of the main protease bound to its endogenous N and C-terminal residues during the formation of the tetramer. This quaternary form is also present in solution, suggesting a transitional state during the C-terminal trans-cleavage. This data sheds light in the structural modifications of the SARS-CoV-2 main protease during maturation, which can guide the development of new inhibitors targeting its intermediary states.
Herein we provide a living summary of the data generated during the COVID Moonshot project focused on the development of SARS-CoV-2 main protease (Mpro) inhibitors. Our approach uniquely combines crowdsourced medicinal chemistry insights with high throughput crystallography, exascale computational chemistry infrastructure for simulations, and machine learning in triaging designs and predicting synthetic routes. This manuscript describes our methodologies leading to both covalent and non-covalent inhibitors displaying protease IC50 values under 150 nM and viral inhibition under 5 uM in multiple different viral replication assays. Furthermore, we provide over 200 crystal structures of fragment-like and lead-like molecules in complex with the main protease. Over 1000 synthesized and ordered compounds are also reported with the corresponding activity in Mpro enzymatic assays using two different experimental setups. The data referenced in this document will be continually updated to reflect the current experimental progress of the COVID Moonshot project, and serves as a citable reference for ensuing publications. All of the generated data is open to other researchers who may find it of use.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.