BackgroundCytochrome P450 monooxygenases (CYPs) form a vast and diverse family of highly variable sequences. They catalyze a wide variety of oxidative reactions and are therefore of great relevance in drug development and biotechnological applications. Despite their differences in sequence and substrate specificity, the structures of CYPs are highly similar. Although being in research focus for years, factors mediating selectivity and activity remain vague.DescriptionThis systematic comparison of CYPs based on the Cytochrome P450 Engineering Database (CYPED) involved sequence and structure analysis of more than 8000 sequences. 31 structures have been applied to generate a reliable structure-based HMM profile in order to predict structurally conserved regions. Therefore, it was possible to automatically transfer these modules on CYP sequences without any secondary structure information, to analyze substrate interacting residues and to compare interaction sites with redox partners.ConclusionsFunctionally relevant structural sites of CYPs were predicted. Regions involved in substrate binding were analyzed in all sequences among the CYPED. For all CYPs that require a reductase, two reductase interaction sites were identified and classified according to their length. The newly gained insights promise an improvement of engineered enzyme properties for potential biotechnological application. The annotated sequences are accessible on the current version of the CYPED. The prediction tool can be applied to any CYP sequence via the web interface at http://www.cyped.uni-stuttgart.de/cgi-bin/strpred/dosecpred.pl.
BackgroundThe Lipase Engineering Database (LED) integrates information on sequence, structure and function of lipases, esterases and related proteins with the α/β hydrolase fold. A new superfamily for Candida antarctica lipase A (CALA) was introduced including the recently published crystal structure of CALA. Since CALA has a highly divergent sequence in comparison to other α/β hydrolases, the Lipase Engineering Database was used to classify CALA in the frame of the already established classification system. This involved the comparison of CALA to similar structures as well as sequence-based comparisons against the content of the LED.ResultsThe new release 3.0 (December 2009) of the Lipase Engineering Database contains 24783 sequence entries for 18585 proteins as well as 656 experimentally determined protein structures, including the structure of CALA. In comparison to the previous release [1] with 4322 protein and 167 structure entries this update represents a significant increase in data volume. By comparing CALA to representative structures from all superfamilies, a structure from the deacetylase superfamily was found to be most similar to the structure of CALA. While the α/β hydrolase fold is conserved in both proteins, the major difference is found in the cap region. Sequence alignments between both proteins show a sequence similarity of only 15%. A multisequence alignment of both protein families was used to create hidden Markov models for the cap region of CALA and showed that the cap region of CALA is unique among all other proteins of the α/β hydrolase fold. By specifically comparing the substrate binding pocket of CALA to other binding pockets of α/β hydrolases, the binding pocket of Candida rugosa lipase was identified as being highly similar. This similarity also applied to the lid of Candida rugosa lipase in comparison to the potential lid of CALA.ConclusionThe LED serves as a valuable tool for the systematic analysis of single proteins or protein families. The updated release 3.0 was used for the evaluation of α/β hydrolases. The HTML version of the database with new features is available at http://www.led.uni-stuttgart.de and provides sequences, structures and a set of analysis tools including phylogenetic trees and HMM profiles
BackgroundThiamine diphosphate (ThDP)-dependent enzymes form a vast and diverse class of proteins, catalyzing a wide variety of enzymatic reactions including the formation or cleavage of carbon-sulfur, carbon-oxygen, carbon-nitrogen, and especially carbon-carbon bonds. Although very diverse in sequence and domain organisation, they share two common protein domains, the pyrophosphate (PP) and the pyrimidine (PYR) domain. For the comprehensive and systematic comparison of protein sequences and structures the Thiamine diphosphate (ThDP)-dependent Enzyme Engineering Database (TEED) was established.DescriptionThe TEED http://www.teed.uni-stuttgart.de contains 12048 sequence entries which were assigned to 9443 different proteins and 379 structure entries. Proteins were assigned to 8 different superfamilies and 63 homologous protein families. For each family, the TEED offers multisequence alignments, phylogenetic trees, and family-specific HMM profiles. The conserved pyrophosphate (PP) and pyrimidine (PYR) domains have been annotated, which allows the analysis of sequence similarities for a broad variety of proteins. Human ThDP-dependent enzymes are known to be involved in many diseases. 20 different proteins and over 40 single nucleotide polymorphisms (SNPs) of human ThDP-dependent enzymes were identified in the TEED.ConclusionsThe online accessible version of the TEED has been designed to serve as a navigation and analysis tool for the large and diverse family of ThDP-dependent enzymes.
BackgroundStandard numbering schemes for families of homologous proteins allow for the unambiguous identification of functionally and structurally relevant residues, to communicate results on mutations, and to systematically analyse sequence-function relationships in protein families. Standard numbering schemes have been successfully implemented for several protein families, including lactamases and antibodies, whereas a numbering scheme for the structural family of thiamine-diphosphate (ThDP) -dependent decarboxylases, a large subfamily of the class of ThDP-dependent enzymes encompassing pyruvate-, benzoylformate-, 2-oxo acid-, indolpyruvate- and phenylpyruvate decarboxylases, benzaldehyde lyase, acetohydroxyacid synthases and 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexadiene-1-carboxylate synthase (MenD) is still missing.Despite a high structural similarity between the members of the ThDP-dependent decarboxylases, their sequences are diverse and make a pairwise sequence comparison of protein family members difficult.ResultsWe developed and validated a standard numbering scheme for the family of ThDP-dependent decarboxylases. A profile hidden Markov model (HMM) was created using a set of representative sequences from the family of ThDP-dependent decarboxylases. The pyruvate decarboxylase from S. cerevisiae (PDB: 2VK8) was chosen as a reference because it is a well characterized enzyme. The crystal structure with the PDB identifier 2VK8 encompasses the structure of the ScPDC mutant E477Q, the cofactors ThDP and Mg2+ as well as the substrate analogue (2S)-2-hydroxypropanoic acid. The absolute numbering of this reference sequence was transferred to all members of the ThDP-dependent decarboxylase protein family. Subsequently, the numbering scheme was integrated into the already established Thiamine-diphosphate dependent Enzyme Engineering Database (TEED) and was used to systematically analyze functionally and structurally relevant positions in the superfamily of ThDP-dependent decarboxylases.ConclusionsThe numbering scheme serves as a tool for the reliable sequence alignment of ThDP-dependent decarboxylases and the unambiguous identification and communication of corresponding positions. Thus, it is the basis for the systematic and automated analysis of sequence-encoded properties such as structural and functional relevance of amino acid positions, because the analysis of conserved positions, the identification of correlated mutations and the determination of subfamily specific amino acid distributions depend on reliable multisequence alignments and the unambiguous identification of the alignment columns. The method is reliable and robust and can easily be adapted to further protein families.
b Metallo--lactamases (MBLs) are enzymes that hydrolyze -lactam antibiotics, resulting in bacterial resistance to these drugs. These proteins have caused concerns due to their facile transference, broad substrate spectra, and the absence of clinically useful inhibitors. To facilitate the classification, nomenclature, and analysis of MBLs, an automated database system was developed, the Metallo--Lactamase Engineering Database (MBLED) (http://www.mbled.uni-stuttgart.de). It contains information on MBLs retrieved from the NCBI peptide database while strictly following the nomenclature by Jacoby and Bush (http://www.lahey.org /Studies/) and the generally accepted class B -lactamase (BBL) standard numbering scheme for MBLs. The database comprises 597 MBL protein sequences and enables systematic analyses of these sequences. A systematic analysis employing the database resulted in the generation of mutation profiles of assigned IMP-and VIM-type MBLs, the identification of five MBL protein entries from the NCBI peptide database that were inconsistent with the Jacoby and Bush nomenclature, and the identification of 15 new IMP candidates and 9 new VIM candidates. Furthermore, the database was used to identify residues with high mutation frequencies and variability (mutation hot spots) that were unexpectedly distant from the active site located in the  sandwich: positions 208 and 266 in the IMP family and positions 215 and 258 in the VIM family. We expect that the MBLED will be a valuable tool for systematically cataloguing and analyzing the increasing number of MBLs being reported.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.