We analysed the roles and distribution of metal ions in enzymatic catalysis using available public databases and our new resource Metal-MACiE (http://www.ebi.ac.uk/thornton-srv/databases/Metal_MACiE/home.html). In Metal-MACiE, a database of metal-based reaction mechanisms, 116 entries covering 21% of the metal-dependent enzymes and 70% of the types of enzyme-catalysed chemical transformations are annotated according to metal function. We used Metal-MACiE to assess the functions performed by metals in biological catalysis and the relative frequencies of different metals in different roles, which can be related to their individual chemical properties and availability in the environment. The overall picture emerging from the overview of Metal-MACiE is that redox-inert metal ions are used in enzymes to stabilize negative charges and to activate substrates by virtue of their Lewis acid properties, whereas redox-active metal ions can be used both as Lewis acids and as redox centres. Magnesium and zinc are by far the most common ions of the first type, while calcium is relatively less used. Magnesium, however, is most often bound to phosphate groups of substrates and interacts with the enzyme only transiently, whereas the other metals are stably bound to the enzyme. The most common metal of the second type is iron, which is prevalent in the catalysis of redox reactions, followed by manganese, cobalt, molybdenum, copper and nickel. The control of the reactivity of redox-active metal ions may involve their association with organic cofactors to form stable units. This occurs sometimes for iron and nickel, and quite often for cobalt and molybdenum.
Metalloproteins are proteins capable of binding one or more metal ions, which may be required for their biological function, or for regulation of their activities or for structural purposes. Genome sequencing projects have provided a huge number of protein primary sequences, but, even though several different elaborate analyses and annotations have been enabled by a rich and ever-increasing portfolio of bioinformatic tools, metal-binding properties remain difficult to predict as well as to investigate experimentally. Consequently, the present knowledge about metalloproteins is only partial. The present bioinformatic research proposes a strategy to answer the question of how many and which proteins encoded in the human genome may require zinc for their physiological function. This is achieved by a combination of approaches, which include: (i) searching in the proteome for the zinc-binding patterns that, on their turn, are obtained from all available X-ray data; (ii) using libraries of metal-binding protein domains based on multiple sequence alignments of known metalloproteins obtained from the Pfam database; and (iii) mining the annotations of human gene sequences, which are based on any type of information available. It is found that 1684 proteins in the human proteome are independently identified by all three approaches as zinc-proteins, 746 are identified by two, and 777 are identified by only one method. By assuming that all proteins identified by at least two approaches are truly zinc-binding and inspecting the proteins identified by a single method, it can be proposed that ca. 2800 human proteins are potentially zinc-binding in vivo, corresponding to 10% of the human proteome, with an uncertainty of 400 sequences. Available functional information suggests that the large majority of human zinc-binding proteins are involved in the regulation of gene expression. The most abundant class of zinc-binding proteins in humans is that of zinc-fingers, with Cys4 and Cys2His2 being the most common types of coordination environment.
Zinc is one of the metal ions essential for life, as it is required for the proper functioning of a large number of proteins. Despite its importance, the annotation of zinc-binding proteins in gene banks or protein domain databases still has significant room for improvement. In the present work, we compiled a list of known zinc-binding protein domains and of known zinc-binding sequence motifs (zinc-binding patterns), and then used them jointly to analyze the proteome of 57 different organisms to obtain an overview of zinc usage by archaeal, bacterial, and eukaryotic organisms. Zinc-binding proteins are an abundant fraction of these proteomes, ranging between 4% and 10%. The number of zinc-binding proteins correlates linearly with the total number of proteins encoded by the genome of an organism, but the proportionality constant of Eukaryota (8.8%) is significantly higher than that observed in Bacteria and Archaea (from 5% to 6%). Most of this enrichment is due to the larger portfolio of regulatory proteins in Eukaryota.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.