A unified coarse-grained model of three major classes of biological molecules—proteins, nucleic acids, and polysaccharides—has been developed. It is based on the observations that the repeated units of biopolymers (peptide groups, nucleic acid bases, sugar rings) are highly polar and their charge distributions can be represented crudely as point multipoles. The model is an extension of the united residue (UNRES) coarse-grained model of proteins developed previously in our laboratory. The respective force fields are defined as the potentials of mean force of biomacromolecules immersed in water, where all degrees of freedom not considered in the model have been averaged out. Reducing the representation to one center per polar interaction site leads to the representation of average site–site interactions as mean-field dipole–dipole interactions. Further expansion of the potentials of mean force of biopolymer chains into Kubo’s cluster-cumulant series leads to the appearance of mean-field dipole–dipole interactions, averaged in the context of local interactions within a biopolymer unit. These mean-field interactions account for the formation of regular structures encountered in biomacromolecules, e.g., α-helices and β-sheets in proteins, double helices in nucleic acids, and helicoidally packed structures in polysaccharides, which enables us to use a greatly reduced number of interacting sites without sacrificing the ability to reproduce the correct architecture. This reduction results in an extension of the simulation timescale by more than four orders of magnitude compared to the all-atom representation. Examples of the performance of the model are presented.FigureComponents of the Unified Coarse Grained Model (UCGM) of biological macromolecules
A general and systematic method for the derivation of the functional expressions for the effective energy terms in coarse-grained force fields of polymer chains is proposed. The method is based on the expansion of the potential of mean force of the system studied in the cluster-cumulant series and expanding the all-atom energy in the Taylor series in the squares of interatomic distances about the squares of the distances between coarse-grained centers, to obtain approximate analytical expressions for the cluster cumulants. The primary degrees of freedom to average about are the angles for collective rotation of the atoms contained in the coarse-grained interaction sites about the respective virtual-bond axes. The approach has been applied to the revision of the virtual-bond-angle, virtual-bond-torsional, and backbone-local-and-electrostatic correlation potentials for the UNited RESidue (UNRES) model of polypeptide chains, demonstrating the strong dependence of the torsional and correlation potentials on virtual-bond angles, not considered in the current UNRES. The theoretical considerations are illustrated with the potentials calculated from the ab initiopotential-energysurface of terminally blocked alanine by numerical integration and with the statistical potentials derived from known protein structures. The revised torsional potentials correctly indicate that virtual-bond angles close to 90° result in the preference for the turn and helical structures, while large virtual-bond angles result in the preference for polyproline II and extended backbone geometry. The revised correlation potentials correctly reproduce the preference for the formation of β-sheet structures for large values of virtual-bond angles and for the formation of α-helical structures for virtual-bond angles close to 90°.
The potentials of mean force (PMFs) were determined, in both water with the TIP3P water model and in vacuo, for systems involving formation of nonpolar dimers composed of bicyclooctane, adamantane (both an all-atom model and a sphere with the radius of 3.4 Å representing adamantane), and fullerene, respectively. A series of umbrella-sampling molecular dynamics simulations with the AMBER force field were carried out for each pair, under both environmental conditions. The PMFs were calculated by using the Weighted Histogram Analysis Method (WHAM). The results were compared with our previously-determined PMF for neopentane. The shape of the PMFs for dimers of all four nonpolar molecules is characteristic of hydrophobic interactions with contact and solvent-separated minima, and desolvation maxima. The positions of all these minima and maxima change with the size of the nonpolar molecule; for larger molecules they shift toward larger distances. Comparison of the PMFs of the bicyclooctane, adamantane, and fullerene dimers in water and in vacuo shows that hydrophobic interactions in each dimer are different from that for the dimer of neopentane. Interactions in the bicyclooctane, adamantane, and fullerene dimers are stronger in vacuo than in water. These dimers cannot be treated as classical spherical hydrophobic objects. The solvent contribution to the PMF was also computed by subtracting the PMF determined in vacuo from that in explicit solvent. The solvent contribution to the PMFs of bicyclooctane, adamantane, and fullerene is positive as opposed to that of neopentane. The water molecules in the first solvation sphere of both adamantane and neopentane dimers are more ordered compared to bulk water, with their dipole moments pointing away from the surface of the dimers. The average number of hydrogen bonds per water molecule in the first hydration shell of adamantane is smaller compared to that in bulk water, but this shell is thicker for all-atom adamantane than for neopentane or a spherical model of adamantane. In the second hydration shell, the average number of hydrogen bonds is greater compared to that in bulk water only for neopentane and a spherical model of adamantane but not for the all-atom model. The strength of the hydrophobic interactions shows a linear dependence on the number of carbon atoms both in water and in vacuo. Smaller nonpolar particles interact more strongly in water than in vacuo. For larger molecules such as bicyclooctane, adamanatane and fullerene, the reversed tendency is observed.
Recent improvements in the protein-structure prediction method developed in our laboratory, based on the thermodynamic hypothesis, are described. The conformational space is searched extensively at the united-residue level by using our physics-based UNRES energy function and the conformational space annealing method of global optimization. The lowest-energy coarse-grained structures are then converted to an all-atom representation and energyminimized with the ECEPP͞3 force field. The procedure was assessed in two recent blind tests of protein-structure prediction. During the first blind test, we predicted large fragments of ␣ and ␣؉ proteins [60 -70 residues with C ␣ rms deviation (rmsd) <6 Å]. However, for ␣؉ proteins, significant topological errors occurred despite low rmsd values. In the second exercise, we predicted whole structures of five proteins (two ␣ and three ␣؉, with sizes of 53-235 residues) with remarkably good accuracy. In particular, for the genomic target TM0487 (a 102-residue ␣؉ protein from Thermotoga maritima), we predicted the complete, topologically correct structure with 7.3-Å C ␣ rmsd. So far this protein is the largest ␣؉ protein predicted based solely on the amino acid sequence and a physics-based potential-energy function and search procedure. For target T0198, a phosphate transport system regulator PhoU from T. maritima (a 235-residue mainly ␣-helical protein), we predicted the topology of the whole six-helix bundle correctly within 8 Å rmsd, except the 32 C-terminal residues, most of which form a -hairpin. These and other examples described in this work demonstrate significant progress in physics-based protein-structure prediction.global optimization ͉ thermodynamic hypothesis T o date, the great majority of successful algorithms for proteinstructure prediction are knowledge-based approaches; they make explicit use of homology modeling (1, 2) or fold recognition methods (2-6). This feature even pertains to most of the methods considered as ab initio (7,8), which, in theory, should not make explicit use of structural databases. However, in-depth understanding of the physical principles of formation of protein structure requires the development of physics-based methods for proteinstructure prediction (9). Moreover, such methods will be independent of structural databases used in the training of knowledgebased methods. Furthermore, physics-based methods will enable us to study the structures of proteins that seem to possess a degenerate native state, such as the prion proteins, to simulate protein-folding pathways, to understand the mechanisms of protein folding, and to study interactions of proteins with other biomacromolecules and their assemblies (e.g., nucleic acids, polysaccharides, lipids, etc.). The underlying principle of physics-based methods for proteinstructure prediction is Anfinsen's thermodynamic hypothesis (10), according to which protein molecules adopt the conformations that are the global minima of their potential-energy surfaces. The methods based on this hypoth...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.