Elucidating the wiring diagram of the human cell is a central goal of the postgenomic era. We combined genome engineering, confocal live-cell imaging, mass spectrometry, and data science to systematically map the localization and interactions of human proteins. Our approach provides a data-driven description of the molecular and spatial networks that organize the proteome. Unsupervised clustering of these networks delineates functional communities that facilitate biological discovery. We found that remarkably precise functional information can be derived from protein localization patterns, which often contain enough information to identify molecular interactions, and that RNA binding proteins form a specific subgroup defined by unique interaction and localization properties. Paired with a fully interactive website (opencell.czbiohub.org), our work constitutes a resource for the quantitative cartography of human cellular organization.
Elucidating the wiring diagram of the human cell is one of the central goals of the post-genomic era. Here, we integrate genome engineering, confocal imaging, mass spectrometry and data science to systematically map protein localization in live cells and protein interactions under endogenous expression conditions. For this, we generated a library of 1,311 CRISPR-edited cell lines harboring fluorescent tags that also serve as handles for affinity capture, and applied a new machine learning framework to encode the interaction and localization profiles of each protein. Our approach provides a data-driven description of the molecular and spatial networks that organize the human proteome. We show that unsupervised clustering of these networks delineates functional groups and facilitates biological discovery, while hierarchical analyses uncover the core features that template cellular architecture. Furthermore, we discover that localization signatures are remarkably predictive of protein function, and often contain enough information to identify molecular interactions. Paired with a fully interactive website (opencell.czbiohub.org), OpenCell is a resource for the quantitative cartography of human cellular organization.
The yeastSaccharomyces cerevisiaeis a powerful model system for systems-wide biology screens and large-scale proteomics methods. Nearly complete proteomics coverage has been achieved owing to advances in mass spectrometry. However, it remains challenging to scale this technology for rapid and high-throughput analysis of the yeast proteome to investigate biological pathways on a global scale. Here we describe a systems biology workflow employing plate-based sample preparation and rapid, single-run, data-independent mass spectrometry analysis (DIA). Our approach is straightforward, easy to implement, and enables quantitative profiling and comparisons of hundreds of nearly complete yeast proteomes in only a few days. We evaluate its capability by characterizing changes in the yeast proteome in response to environmental perturbations, identifying distinct responses to each of them and providing a comprehensive resource of these responses. Apart from rapidly recapitulating previously observed responses, we characterized carbon source-dependent regulation of the GID E3 ligase, an important regulator of cellular metabolism during the switch between gluconeogenic and glycolytic growth conditions. This unveiled regulatory targets of the GID ligase during a metabolic switch. Our comprehensive yeast system readout pinpointed effects of a single deletion or point mutation in the GID complex on the global proteome, allowing the identification and validation of targets of the GID E3 ligase. Moreover, this approach allowed the identification of targets from multiple cellular pathways that display distinct patterns of regulation. Although developed in yeast, rapid whole-proteome–based readouts can serve as comprehensive systems-level assays in all cellular systems.
Recent advances in mass spectrometry (MS)-based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Whereas peptide identification is already scalable, most label-free quantification (LFQ) algorithms scale quadratic or cubic with the sample numbers, which may even preclude the analysis of large-scale data. Here we introduce directLFQ, a ratio-based approach for sample normalization and the calculation of protein intensities. It estimates quantities via aligning samples and ion traces by shifting them on top of each other in logarithmic space. Importantly, directLFQ scales linearly with the number of samples, allowing analyses of large studies to finish in minutes instead of days or months. We quantify 10,000 proteomes in 10 minutes and 100,000 proteomes in less than two hours - thousand-fold faster than some implementations of the popular LFQ algorithm MaxLFQ. In-depth characterization of directLFQ reveals excellent normalization properties and benchmark results, comparing favorably to MaxLFQ for both data-dependent acquisition (DDA) and data-independent acquisition (DIA). Additionally, directLFQ provides normalized peptide intensity estimates for peptide-level comparisons. It is available as an open-source Python package and as a GUI with a one-click installer and can be used in the AlphaPept ecosystem as well as downstream of most common computational proteomics pipelines.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.