We
present a supercomputer-driven pipeline for in silico drug discovery
using enhanced sampling molecular dynamics (MD) and ensemble docking.
Ensemble docking makes use of MD results by docking compound databases
into representative protein binding-site conformations, thus taking
into account the dynamic properties of the binding sites. We also
describe preliminary results obtained for 24 systems involving eight
proteins of the proteome of SARS-CoV-2. The MD involves temperature
replica exchange enhanced sampling, making use of massively parallel
supercomputing to quickly sample the configurational space of protein
drug targets. Using the Summit supercomputer at the Oak Ridge National
Laboratory, more than 1 ms of enhanced sampling MD can be generated
per day. We have ensemble docked repurposing databases to 10 configurations
of each of the 24 SARS-CoV-2 systems using AutoDock Vina. Comparison
to experiment demonstrates remarkably high hit rates for the top scoring
tranches of compounds identified by our ensemble approach. We also
demonstrate that, using Autodock-GPU on Summit, it is possible to
perform exhaustive docking of one billion compounds in under 24 h.
Finally, we discuss preliminary results and planned improvements to
the pipeline, including the use of quantum mechanical (QM), machine
learning, and artificial intelligence (AI) methods to cluster MD trajectories
and rescore docking poses.
We use quantum detector tomography to characterize the qubit readout in terms of measurement POVMs on IBM Quantum Computers IBM Q 5 Tenerife and IBM Q 5 Yorktown. Our results suggest that the characterized detector model deviates from the ideal projectors by a few percent. Further improvement on this characterization can be made by adopting two-or more-qubit detector models instead of independent single-qubit detectors for all the qubits in one device. An unexpected behavior was seen in the physical qubit labelled as qubit 3 of IBM Q 5 Tenerife, which can be a consequence of detector crosstalk or qubit operations influencing each other and requires further investigation. This peculiar behavior is consistent with characterization from the more sophisticated approach of the gate set tomography. We also discuss how the characterized detectors' POVM, despite deviation from the ideal projectors, can be used to estimate the ideal detection distribution.
X-ray absorption spectroscopy is a premier, element-specific technique for materials characterization. Specifically, the x-ray absorption near edge structure (XANES) encodes important information about the local chemical environment of an absorbing atom, including coordination number, symmetry and oxidation state. Interpreting XANES spectra is a key step towards understanding the structural and electronic properties of materials, and as such, extracting structural and electronic descriptors from XANES spectra is akin to solving a challenging inverse problem. Existing methods rely on empirical fingerprints, which are often qualitative or semi-quantitative and not transferable. In this study, we present a machine learning-based approach, which is capable of classifying the local coordination environments of the absorbing atom from simulated K-edge XANES spectra. The machine learning classifiers can learn important spectral features in a broad energy range without human bias, and once trained, can make predictions on the fly. The robustness and fidelity of the machine learning method are demonstrated by an average 86% accuracy across the wide chemical space of oxides in eight 3d transition metal families. We found that spectral features beyond the pre-edge region play an important role in the local structure classification problem, especially for the late 3d transition metal elements.
Simulations of excited state properties, such as spectral functions, are often computationally expensive and therefore not suitable for high-throughput modeling. As a proof of principle, we demonstrate that graph-based neural networks can be used to predict the x-ray absorption nearedge structure spectra of molecules to quantitative accuracy. Specifically, the predicted spectra reproduce nearly all prominent peaks, with 90% of the predicted peak locations within 1 eV of the ground truth. Besides its own utility in spectral analysis and structure inference, our method can be combined with structure search algorithms to enable high-throughput spectrum sampling of the vast material configuration space, which opens up new pathways to material design and discovery.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.