Solubility of proteins on overexpression in Escherichia coli is a manifestation of the net effect of several sequence-dependent and sequence-independent factors. This study aims to delineate the relationship between the primary structure and solubility on overexpression. The amino acid sequences of proteins reported to be soluble or to form inclusion bodies on overexpression in E. coli under normal growth conditions were analyzed. The results show a positive correlation between thermostability and solubility of proteins, and an inverse correlation between the in vivo half-life of proteins and solubility. The amino acid (Asn, Thr, Tyr) composition and the tripeptide frequency of the protein were also found to influence its solubility on overexpression. The amino acids that were seen to be present in a comparatively higher frequency in inclusion body-forming proteins have a higher sheet propensity, whereas those that are seen more in soluble proteins have a higher helix propensity; this is indicative of a possible correlation between sheet propensity and inclusion body formation. Thus, the present analysis shows that thermostability, in vivo half-life, Asn, Thr, and Tyr content, and tripeptide composition of a protein are correlated to the propensity of a protein to be soluble on overexpression in E. coli. The precise mechanism by which these properties affect the solubility status of the overexpressed protein remains to be understood.Keywords: aliphatic index; instability index; solubility index; thermostability; tripeptide frequency; inclusion body Supplemental material: see www.proteinscience.org Overexpression of recombinant proteins is needed both in industry and in research for the production of proteins with pharmaceutical, structural, and/or biochemical relevance (Clark 1998). Escherichia coli, the preferred host for overexpressing recombinant proteins, is fast and inexpensive to cultivate, easy to handle and manipulate genetically, and generally yields high levels of recombinant proteins (Rudolph 1990; Clark 1998). However, a significantly large number of proteins form insoluble inclusion bodies when overexpressed in E. coli; these have to be solubilized and refolded to obtain the functional protein (Rudolph 1990;Rudolph and Lilie 1996;Pedelacq et al. 2002;Tsumoto et al. 2003;Tresaugues et al. 2004).Several factors have been suggested to contribute to inclusion body formation in E. coli: (1) high local concentration of the overexpressed protein; (2) reducing environment in the cytoplasm due to high levels of glutathione preventing disulfide bond formation (Makrides 1996;Lilie et al. 1998); (3) lack of posttranslational modifications such as glycosylation which could improve protein's solubility (Zhang et al. 1998); (4) improper interactions with chaperones and other proteins participating in folding in vivo (Mogk et al. 2002); (5) increased aggregation of folding intermediates due to their limited solubility (King et al. 1996); (6) intermolecular cross-linking via disulfideshowever, proteins witho...
Six physicochemical properties together with residue and dipeptide-compositions have been used to develop a support vector machine-based classifier to predict the overexpression status in E.coli. The prediction accuracy is approximately 72% suggesting that it performs reasonably well in predicting the propensity of a protein to be soluble or to form inclusion bodies. The algorithm could also correctly predict the change in solubility for most of the point mutations reported in literature. This algorithm can be a useful tool in screening protein libraries to identify soluble variants of proteins.
BackgroundGriseofulvin, an antifungal drug, has recently been shown to inhibit proliferation of various types of cancer cells and to inhibit tumor growth in athymic mice. Due to its low toxicity, griseofulvin has drawn considerable attention for its potential use in cancer chemotherapy. This work aims to understand how griseofulvin suppresses microtubule dynamics in living cells and sought to elucidate the antimitotic and antiproliferative action of the drug.MethodsThe effects of griseofulvin on the dynamics of individual microtubules in live MCF-7 cells were measured by confocal microscopy. Immunofluorescence microscopy, western blotting and flow cytometry were used to analyze the effects of griseofulvin on spindle microtubule organization, cell cycle progression and apoptosis. Further, interactions of purified tubulin with griseofulvin were studied in vitro by spectrophotometry and spectrofluorimetry. Docking analysis was performed using autodock4 and LigandFit module of Discovery Studio 2.1.ResultsGriseofulvin strongly suppressed the dynamic instability of individual microtubules in live MCF-7 cells by reducing the rate and extent of the growing and shortening phases. At or near half-maximal proliferation inhibitory concentration, griseofulvin dampened the dynamicity of microtubules in MCF-7 cells without significantly disrupting the microtubule network. Griseofulvin-induced mitotic arrest was associated with several mitotic abnormalities like misaligned chromosomes, multipolar spindles, misegregated chromosomes resulting in cells containing fragmented nuclei. These fragmented nuclei were found to contain increased concentration of p53. Using both computational and experimental approaches, we provided evidence suggesting that griseofulvin binds to tubulin in two different sites; one site overlaps with the paclitaxel binding site while the second site is located at the αβ intra-dimer interface. In combination studies, griseofulvin and vinblastine were found to exert synergistic effects against MCF-7 cell proliferation.ConclusionsThe study provided evidence suggesting that griseofulvin shares its binding site in tubulin with paclitaxel and kinetically suppresses microtubule dynamics in a similar manner. The results revealed the antimitotic mechanism of action of griseofulvin and provided evidence suggesting that griseofulvin alone and/or in combination with vinblastine may have promising role in breast cancer chemotherapy.
Galactose-binding proteins characterize an important subgroup of sugar-binding proteins that are involved in a variety of biological processes. Structural studies have shown that the Gal-specific proteins encompass a diverse range of primary and tertiary structures. The binding sites for galactose also seem to vary in different protein-galactose complexes. No common binding site features that are shared by the Gal-specific proteins to achieve ligand specificity are so far known. With the assumption that common recognition principles will exist for common substrate recognition, the present study was undertaken to identify and characterize any unique galactose-binding site signature by analyzing the three-dimensional (3D) structures of 18 protein-galactose complexes. These proteins belong to 7 nonhomologous families; thus, there is no sequence or structural similarity across the families. Within each family, the binding site residues and their relative distances were well conserved, but there were no similarities across families. A novel, yet simple, approach was adopted to characterize the binding site residues by representing their relative spatial dispositions in polar coordinates. A combination of the deduced geometrical features with the structural characteristics, such as solvent accessibility and secondary structure type, furnished a potential galactose-binding site signature. The signature was evaluated by incorporation into the program COTRAN to search for potential galactose-binding sites in proteins that share the same fold as the known galactose-binding proteins. COTRAN is able to detect galactose-binding sites with a very high specificity and sensitivity. The deduced galactose-binding site signature is strongly validated and can be used to search for galactose-binding sites in proteins. PROSITE-type signature sequences have also been inferred for galectin and C-type animal lectin-like fold families of Gal-binding proteins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.