Despite decades of accumulated knowledge about proteins and their post-translational modifications (PTMs), numerous questions remain regarding their molecular composition and biological function. One of the most fundamental queries is the extent to which the combinations of DNA-, RNA- and PTM-level variations explode the complexity of the human proteome. Here, we outline what we know from current databases and measurement strategies including mass spectrometry-based proteomics. In doing so, we examine prevailing notions about the number of modifications displayed on human proteins and how they combine to generate the protein diversity underlying health and disease. We frame central issues regarding determination of protein-level variation and PTMs, including some paradoxes present in the field today. We use this framework to assess existing data and to ask the question, "How many distinct primary structures of proteins (proteoforms) are created from the 20,300 human genes?" We also explore prospects for improving measurements to better regularize protein-level biology and efficiently associate PTMs to function and phenotype.
Summary The quantitative concepts used to reason about gene regulation largely derive from bacterial studies. We show that this bacterial paradigm cannot explain the sharp expression of a canonical developmental gene in response to a regulating transcription factor (TF). In the absence of energy expenditure, with regulatory DNA at thermodynamic equilibrium, information integration across multiple TF binding sites can generate the required sharpness but with strong constraints on the resulting “higher-order cooperativities”. Even with such integration there is a “Hopfield barrier” to sharpness, represented, for n TF binding sites, by the Hill function with Hill coefficient n. If, however, energy is expended to maintain regulatory DNA away from thermodynamic equilibrium, as in kinetic proofreading, this barrier can be breached and greater sharpness achieved. Our approach is grounded in fundamental physics, leads to testable experimental predictions and suggests how a quantitative paradigm for eukaryotic gene regulation can be formulated.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.