Outer membrane proteins (OMPs) are the proteins in the surface of Gram-negative bacteria. These proteins have diverse functions but a single topology: the β-barrel. Sequence analysis has suggested that this common fold is a β-hairpin repeat protein, and that amplification of the β-hairpin has resulted in 8–26-stranded barrels. Using an integrated approach that combines sequence and structural analyses, we find events in which non-amplification diversification also increases barrel strand number. Our network-based analysis reveals strand-number-based evolutionary pathways, including one that progresses from a primordial 8-stranded barrel to 16-strands and further, to 18-strands. Among these pathways are mechanisms of strand number accretion without domain duplication, like a loop-to-hairpin transition. These mechanisms illustrate perpetuation of repeat protein topology without genetic duplication, likely induced by the hydrophobic membrane. Finally, we find that the evolutionary trace is particularly prominent in the C-terminal half of OMPs, implicating this region in the nucleation of OMP folding.
Metalloenzymes are 40% of all enzymes and can perform all seven classes of enzyme reactions. Because of the physicochemical similarities between the active sites of metalloenzymes and inactive metal binding sites, it is challenging to differentiate between them. Yet distinguishing these two classes is critical for the identification of both native and designed enzymes. Because of similarities between catalytic and non-catalytic metal binding sites, finding physicochemical features that distinguish these two types of metal sites can indicate aspects that are critical to enzyme function. In this work, we develop the largest structural dataset of enzymatic and non-enzymatic metalloprotein sites to date. We then use a decision-tree ensemble machine learning model to classify metals bound to proteins as enzymatic or non-enzymatic with 92.2% precision and 90.1% recall. Our model scores electrostatic and pocket lining features as more important than pocket volume, despite the fact that volume is the most quantitatively different feature between enzyme and non-enzymatic sites. Finally, we find our model has overall better performance in a side-to-side comparison against other methods that differentiate enzymatic from non-enzymatic sequences. We anticipate that our model’s ability to correctly identify which metal sites are responsible for enzymatic activity could enable identification of new enzymatic mechanisms and de novo enzyme design.
There are around 100 varieties of outer membrane proteins in each Gram-negative bacteria. All of these proteins have the same fold-an up-down β-barrel. It has been suggested that all membrane β-barrels excluding lysins are homologous. Here we suggest that β-barrels of efflux pumps have converged on this fold as well. By grouping structurally solved outer membrane β-barrels (OMBBs) by sequence we find that the membrane environment may have led to convergent evolution of the barrel fold. Specifically, the lack of sequence linkage to other barrels coupled with distinctive structural differences, such as differences in strand tilt and barrel radius, suggest that the outer membrane factor of efflux pumps evolutionarily converged on the barrel. Rather than being related to other OMBBs, sequence and structural similarity in the periplasmic region of the outer membrane factor of efflux pumps suggests an evolutionary link to the periplasmic subunit of the same pump complex.
There are around 100 types of integral outer membrane proteins in each Gram negative bacteria. All of these proteins have the same fold-an up-down β-barrel. It has been suggested that all membrane β-barrels other than lysins are homologous. Here we suggest that β-barrels of efflux pumps have converged on this fold as well. By grouping structurally-solved outer membrane β-barrels (OMBBs) by sequence we find evidence that the membrane environment may have led to convergent evolution of the barrel fold. Specifically, the lack of sequence linkage to other barrels coupled with distinctive structural differences, such as differences in strand tilt and barrel radius, suggest that efflux pumps have evolutionarily converged on the barrel. Finally, we find a possible ancestor for the OMBB efflux pumps as they are related to periplasmic components of the same pumps.
As a structural class, tight turns can control molecular recognition, enzymatic activity, and nucleation of folding. They have been extensively characterized in soluble proteins but have not been characterized in outer membrane proteins (OMPs), where they also support critical functions. We clustered the 4 to 6 residue tight turns of 110 OMPs to characterize the phi/psi angles, sequence, and hydrogen bonding of these structures. We find significant differences between reports of soluble protein tight turns and OMP tight turns. Since OMP strands are less twisted than soluble strands, they favor different turn structures types. Moreover, the membrane localization of OMPs yields different sequence hallmarks for their tight turns relative to soluble protein turns. We also characterize the differences in phi/psi angles, sequence, and hydrogen bonding between OMP extracellular loops and OMP periplasmic turns. As previously noted, the extracellular loops tend to be much longer than the periplasmic turns. We find that this difference in length is due to the broader distribution of lengths of the extracellular loops not a large difference in the median length. Extracellular loops also tend to have more charged residues as predicted by the charge-out rule. Finally, in all OMP tight turns, hydrogen bonding between the side chain and backbone 2 to 4 residues away from that side chain plays an important role. These bonds preferentially use an Asp, Asn, Ser, or Thr residue in a beta or pro phi/psi conformation. We anticipate that this study will be applicable to future design and structure prediction of OMPs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.