As the length of molecular dynamics (MD) trajectories grows with increasing computational power, so does the importance of clustering methods for partitioning trajectories into conformational bins. Of the methods available, the vast majority require users to either have some a priori knowledge about the system to be clustered or to tune clustering parameters through trial and error. Here we present non-parametric uses of two modern clustering techniques suitable for first-pass investigation of an MD trajectory. Being non-parametric, these methods require neither prior knowledge nor parameter tuning. The first method, HDBSCAN, is fast—relative to other popular clustering methods—and is able to group unstructured or intrinsically disordered systems (such as intrinsically disordered proteins, or IDPs) into bins that represent global conformational shifts. HDBSCAN is also useful for determining the overall stability of a system—as it tends to group stable systems into one or two bins—and identifying transition events between metastable states. The second method, iMWK-Means, with explicit rescaling followed by K-Means, while slower than HDBSCAN, performs well with stable, structured systems such as folded proteins and is able to identify higher resolution details such as changes in relative position of secondary structural elements. Used in conjunction, these clustering methods allow a user to discern quickly and without prior knowledge the stability of a simulated system and identify both local and global conformational changes.
Zinc-finger proteins are regulators of critical signaling pathways for various cellular functions, including apoptosis and oncogenesis. Here, we investigate how binding site protonation states and zinc coordination influence protein structure, dynamics, and ultimately function, as these pivotal regulatory proteins are increasingly important for protein engineering and therapeutic discovery. To better understand the thermodynamics and dynamics of the zinc finger of NEMO (NF-κB essential modulator), as well as the role of zinc, we present results of 20 μs molecular dynamics trajectories, 5 μs for each of four active site configurations. Consistent with experimental evidence, the zinc ion is essential for mechanical stabilization of the functional, folded conformation. Hydrogen bond motifs are unique for deprotonated configurations yet overlap in protonated cases. Correlated motions and principal component analysis corroborate the similarity of the protonated configurations and highlight unique relationships of the zinc-bound configuration. We hypothesize a potential mechanism for zinc binding from results of the thiol configurations. The deprotonated, zinc-bound configuration alone predominantly maintains its tertiary structure throughout all 5 μs and alludes rare conformations potentially important for (im)proper zinc-finger-related protein–protein or protein–DNA interactions.
MutSα is a key component in the mismatch repair (MMR) pathway. This protein is responsible for initiating the signaling pathways for DNA repair or cell death. Herein we investigate this heterodimer's post-recognition, post-binding response to three types of DNA damage involving cytotoxic, anti-cancer agents-carboplatin, cisplatin, and FdU. Through a combination of supervised and unsupervised machine learning techniques along with more traditional structural and kinetic analysis applied to all-atom molecular dynamics (MD) calculations, we predict that MutSα has a distinct response to each of the three damage types. Via a binary classification tree (a supervised machine learning technique), we identify key hydrogen bond motifs unique to each type of damage and suggest residues for experimental mutation studies. Through a combination of a recently developed clustering (unsupervised learning) algorithm, RMSF calculations, PCA, and correlated motions we predict that each type of damage causes MutSα to explore a specific region of conformation space. Detailed analysis suggests a short range effect for carboplatin-primarily altering the structures and kinetics of residues within 10 angstroms of the damaged DNA-and distinct longer-range effects for cisplatin and FdU. In our simulations, we also observe that a key phenylalanine residue-known to stack with a mismatched or unmatched bases in MMR-stacks with the base complementary to the damaged base in 88.61% of MD frames containing carboplatinated DNA. Similarly, this Phe71 stacks with the base complementary to damage in 91.73% of frames with cisplatinated DNA. This residue, however, stacks with the damaged base itself in 62.18% of trajectory frames with FdU-substituted DNA and has no stacking interaction at all in 30.72% of these frames. Each drug investigated here induces a unique perturbation in the MutSα complex, indicating the possibility of a distinct signaling event and specific repair or death pathway (or set of pathways) for a given type of damage.
Molecular dynamics (MD) simulation methods have seen significant improvement since their inception in the late 1950s. Constraints of simulation size and duration that once impeded the field have lessened with the advent of better algorithms, faster processors, and parallel computing. With newer techniques and hardware available, MD simulations of more biologically relevant timescales can now sample a broader range of conformational and dynamical changes including rare events. One concern in the literature has been under which circumstances it is sufficient to perform many shorter timescale simulations and under which circumstances fewer longer simulations are necessary. Herein, our simulations of the zinc finger NEMO (2JVX) using multiple simulations of length 15, 30, 1000, and 3000 ns are analyzed to provide clarity on this point.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.