Accurate structural validation of proteins is of extreme importance in studies like protein structure prediction, analysis of molecular dynamic simulation trajectories and finding subtle changes in very similar structures. The benchmarks for today's structure validation are scoring methods like global distance test-total structure (GDT-TS), TM-score and root mean square deviations (RMSD). However, there is a lack of methods that look at both the protein backbone and side-chain structures at the global connectivity level and provide information about the differences in connectivity. To address this gap, a graph spectral based method (NSS-network similarity score) which has been recently developed to rigorously compare networks in diverse fields, is adopted to compare protein structures both at the backbone and at the side-chain noncovalent connectivity levels. In this study, we validate the performance of NSS by investigating protein structures from X-ray structures, modeling (including CASP models), and molecular dynamics simulations. Further, we systematically identify the local and the global regions of the structures contributing to the difference in NSS, through the components of the score, a feature unique to this spectral based scoring scheme. It is demonstrated that the method can quantify subtle differences in connectivity compared to a reference protein structure and can form a robust basis for protein structure comparison. Additionally, we have also introduced a network-based method to analyze fluctuations in side chain interactions (edge-weights) in an ensemble of structures, which can be an useful tool for the analysis of MD trajectories.
In this perspective article, we present a multidisciplinary approach for characterizing protein structure networks. We first place our approach in its historical context and describe the manner in which it synthesizes concepts from quantum chemistry, biology of polymer conformations, matrix mathematics, and percolation theory. We then explicitly provide the method for constructing the protein structure network in terms of non-covalently interacting amino acid side chains and show how a mine of information can be obtained from the graph spectra of these networks. Employing suitable mathematical approaches, such as the use of a weighted, Laplacian matrix to generate the spectra, enables us to develop rigorous methods for network comparison and to identify crucial nodes responsible for the network integrity through a perturbation approach. Our scoring methods have several applications in structural biology that are elusive to conventional methods of analyses. Here, we discuss the instances of: (a) Protein structure comparison that include the details of side chain connectivity, (b) The contribution to node clustering as a function of bound ligand, explaining the global effect of local changes in phenomena such as allostery and (c) The identification of crucial amino acids for structural integrity, derived purely from the spectra of the graph. We demonstrate how our method enables us to obtain valuable information on key proteins involved in cellular functions and diseases such as GPCR and HIV protease, and discuss the biological implications. We then briefly describe how concepts from percolation theory further augment our analyses. In our concluding perspective for future developments, we suggest a further unifying approach to protein structure analyses and a judicious choice of questions to employ our methods for larger, more complex networks, such as metabolic and disease networks.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.