CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (67%), mouse (15%) and rat (10%). Given the vital functions of these macromolecular machines, their identification and functional characterization is foundational to our understanding of normal and disease biology. The new CORUM 3.0 release encompasses 4274 protein complexes offering the largest and most comprehensive publicly available dataset of mammalian protein complexes. The CORUM dataset is built from 4473 different genes, representing 22% of the protein coding genes in humans. Protein complexes are described by a protein complex name, subunit composition, cellular functions as well as the literature references. Information about stoichiometry of subunits depends on availability of experimental data. Recent developments include a graphical tool displaying known interactions between subunits. This allows the prediction of structural interconnections within protein complexes of unknown structure. In addition, we present a set of 58 protein complexes with alternatively spliced subunits. Those were found to affect cellular functions such as regulation of apoptotic activity, protein complex assembly or define cellular localization. CORUM is freely accessible at http://mips.helmholtz-muenchen.de/corum/.
The CORUM database has been providing comprehensive reference information about experimentally characterized, mammalian protein complexes and their associated biological and biomedical properties since 2007. Given that most catalytic and regulatory functions of the cell are carried out by protein complexes, their composition and characterization is of greatest importance in basic and disease biology. The new CORUM 4.0 release encompasses 5204 protein complexes offering the largest and most comprehensive publicly available dataset of manually curated mammalian protein complexes. The CORUM dataset is built from 5299 different genes, representing 26% of the protein coding genes in humans. Complex information from 3354 scientific articles is mainly obtained from human (70%), mouse (16%) and rat (9%) cells and tissues. Recent curation work includes sets of protein complexes, Functional Complex Groups, that offer comprehensive collections of published data in specific biological processes and molecular functions. In addition, a new graphical analysis tool was implemented that displays co-expression data from the subunits of protein complexes. CORUM is freely accessible at http://mips.helmholtz-muenchen.de/corum/.
Instability of simple DNA repeats has been known as a common cause of hereditary ataxias for over 20 years. Routine genetic diagnostics of these phenotypically similar diseases still rely on an iterative workflow for quantification of repeat units by PCR-based methods of limited precision. We established and validated clinical nanopore Cas9-targeted sequencing (Clin-CATS), an amplification-free method for simultaneous analysis of ten repeat loci associated with clinically overlapping hereditary ataxias. The method combines target enrichment by CRISPR/Cas9, Oxford Nanopore long-read sequencing, and a bioinformatics pipeline utilizing the tools STRIque and Megalodon for parallel detection of length, sequence, methylation, and composition of the repeat loci. Clin-CATS allowed for the precise and parallel analysis of 10 repeat loci associated with adult-onset ataxia and revealed additional parameter such as FMR1 promotor methylation and repeat sequence required for diagnosis at the same time. Using Clin-CATS we analyzed 100 clinical samples of undiagnosed ataxia patients and identified causative repeat expansions in 28 patients. Parallel repeat analysis enabled a molecular diagnosis of ataxias independent of preconceptions based on clinical presentation. Biallelic expansions within RFC1 were identified as the most frequent cause of ataxia. We characterized the RFC1 repeat composition of all patients and identified a novel repeat motif, AGGGG. Our results highlight the power of Clin-CATS as a readily expandable workflow for the in-depth analysis and diagnosis of phenotypically overlapping repeat expansion disorders.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.