A description of many biological processes requires knowledge of the 3-D structure of proteins and, in particular, the defined active site responsible for biological function. Many proteins, the genes of which have been identified as the result of human genome sequencing, and which were synthesized experimentally, await identification of their biological activity. Currently used methods do not always yield satisfactory results, and new algorithms need to be developed to recognize the localization of active sites in proteins. This paper describes a computational model that can be used to identify potential areas that are able to interact with other molecules (ligands, substrates, inhibitors, etc.). The model for active site recognition is based on the analysis of hydrophobicity distribution in protein molecules. It is shown, based on the analyses of proteins with known biological activity and of proteins of unknown function, that the region of significantly irregular hydrophobicity distribution in proteins appears to be function related.
This paper introduces a new model that enables researchers to conduct protein folding simulations. A two-step in silico process is used in the course of structural analysis of a set of fast-folding proteins. The model assumes an early stage (ES) that depends solely on the backbone conformation, as described by its geometrical properties--specifically, by the V-angle between two sequential peptide bond planes (which determines the radius of curvature, also called R-radius, according to a second-degree polynomial form). The agreement between the structure under consideration and the assumed model is measured in terms of the magnitude of dispersion of both parameters with respect to idealized values. The second step, called late-stage folding (LS), is based on the "fuzzy oil drop" model, which involves an external hydrophobic force field described by a three-dimensional Gauss function. The degree of conformance between the structure under consideration and its idealized model is expressed quantitatively by means of the Kullback-Leibler entropy, which is a measure of disparity between the observed and expected hydrophobicity distributions. A set of proteins, representative of the fast-folding group - specifically, cold shock proteins - is shown to agree with the proposed model.
The comparison of eight tools applicable to ligand-binding site prediction is presented. The methods examined cover three types of approaches: the geometrical (CASTp, PASS, Pocket-Finder), the physicochemical (Q-SiteFinder, FOD) and the knowledge-based (ConSurf, SuMo, WebFEATURE). The accuracy of predictions was measured in reference to the catalytic residues documented in the Catalytic Site Atlas. The test was performed on a set comprising selected chains of hydrolases. The results were analysed with regard to size, polarity, secondary structure, accessible solvent area of predicted sites as well as parameters commonly used in machine learning (F-measure, MCC). The relative accuracies of predictions are presented in the ROC space, allowing determination of the optimal methods by means of the ROC convex hull. Additionally the minimum expected cost analysis was performed. Both advantages and disadvantages of the eight methods are presented. Characterization of protein chains in respect to the level of difficulty in the active site prediction is introduced. The main reasons for failures are discussed. Overall, the best performance offers SuMo followed by FOD, while Pocket-Finder is the best method among the geometrical approaches.Electronic supplementary materialThe online version of this article (doi:10.1007/s10822-010-9402-0) contains supplementary material, which is available to authorized users.
Mutations in proteins introduce structural changes and influence biological activity: the specific effects depend on the location of the mutation. The simple method proposed in the present paper is based on a two-step model of in silico protein folding. The structure of the first intermediate is assumed to be determined solely by backbone conformation. The structure of the second one is assumed to be determined by the presence of a hydrophobic center. The comparable structural analysis of the set of mutants is performed to identify the mutant-induced structural changes. The changes of the hydrophobic core organization measured by the divergence entropy allows quantitative comparison estimating the relative structural changes upon mutation. The set of antifreeze proteins, which appeared to represent the hydrophobic core structure accordant with “fuzzy oil drop” model was selected for analysis.
To cite this version:Katarzyna Prymula, Kinga Salapa, Irena Roterman. "Fuzzy oil drop" model applied to individual small proteins built of 70 amino acids. Journal of Molecular Modeling, Springer Verlag (Germany), 2010, 16 (7) Abstract: The proteins composed of short polypeptides (about 70 amino acid residues) representing the following functional groups (according to PDB notation): growth hormones, serine protease inhibitors, antifreeze proteins, chaperones and proteins of unknown function, were selected for structural and functional analysis. Classification based on the distribution of hydrophobicity in terms of deficiency/excess as the measure of structural and functional specificity is presented. The experimentally observed distribution of hydrophobicity in the protein body is compared to the idealized one expressed by a three-dimensional Gauss function. The differences between these two distributions reveal the specificity of structural/functional characteristics of the protein. The residues of hydrophobicity deficiency versus the idealized distribution are assumed to indicate cavities with the potential to bind ligands, while the residues of hydrophobicity excess are interpreted as potentially participating in protein-protein complexation. The distribution of hydrophobicity irregularity seems to be specific for particular structures and functions of proteins. A comparative analysis of such profiles is carried out to identify the potential biological activity of proteins of unknown function.Response to Reviewers: Comments to the reviewers Reviewer #1The sentence "protein structure determines its biological function" got changed to the form : "The spatial distribution of amino acid residues and particularly distribution of their specific hydrophobicity in a protein structure is assumed to influence the biological function". Reviewer #2Ad.1. The section INTRODUCTION has been modified making the problem of NBP less exposed. The paper describing the structures of NBP form is in press currently. The appearance of that paper will make clear the idea of NBP and the usefulness of the presented method. The real proteins available in PDB allowed the comparative analysis of the NBP proteins reveling some substantial differences and similarities between these two groups of proteins position #18 on the reference list (Prymula K, Piwowar M, Kochanczyk M, Flis L, Malawski M, Szepieniec T, Evangelista G, Minervini G, Polticelli F, Wisniowski Z, Salapa K, Matczynska E, and Roterman I (2009) In silico structural study of random amino acid sequence proteins not present in nature. Chem Biodivers, in press). Ad.2. The hydrophobicity scaleThe hydrophobicity scales (theoretical and experimental) were compared in details in respect to "fuzzy oil drop" model in other publication. The "fuzzy oil drop" estimates the hydrophobicity distribution in the very simplified and averaged form. In result, the relative distribution of hydrophobicity is under consideration. The differences observed between different scales get marginal in this st...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.