The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world’s largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.
MotivationBioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters).Availability and ImplementationThe software is freely available at github.com/BioContainers/.
The original PRIDE Inspector tool was developed as an open source standalone tool to enable the visualization and validation of mass-spectrometry (MS)-based proteomics data before data submission or already publicly available in the Proteomics Identifications (PRIDE) database. The initial implementation of the tool focused on visualizing PRIDE data by supporting the PRIDE XML format and a direct access to private (password protected) and public experiments in PRIDE.The ProteomeXchange (PX) Consortium has been set up to enable a better integration of existing public proteomics repositories, maximizing its benefit to the scientific community through the implementation of standard submission and dissemination pipelines. Within the Consortium, PRIDE is focused on supporting submissions of tandem MS data. The increasing use and popularity of the new Proteomics Standards Initiative (PSI) data standards such as mzIdentML and mzTab, and the diversity of workflows supported by the PX resources, prompted us to design and implement a new suite of algorithms and libraries that would build upon the success of the original PRIDE Inspector and would enable users to visualize and validate PX “complete” submissions. The PRIDE Inspector Toolsuite supports the handling and visualization of different experimental output files, ranging from spectra (mzML, mzXML, and the most popular peak lists formats) and peptide and protein identification results (mzIdentML, PRIDE XML, mzTab) to quantification data (mzTab, PRIDE XML), using a modular and extensible set of open-source, cross-platform libraries. We believe that the PRIDE Inspector Toolsuite represents a milestone in the visualization and quality assessment of proteomics data. It is freely available at http://github.com/PRIDE-Toolsuite/.
Filaminopathy is a subtype of myofibrillar myopathy caused by mutations in FLNC, the gene encoding filamin C, and histologically characterized by pathologic accumulation of several proteins within skeletal muscle fibers. With the aim to get new insights in aggregate composition, we collected aggregates and control tissue from skeletal muscle biopsies of six myofibrillar myopathy patients harboring three different FLNC mutations by laser microdissection and analyzed the samples by a label-free mass spectrometry approach. A total of 390 proteins were identified, and 31 of those showed significantly higher spectral indices in aggregates compared with patient controls with a ratio >1.8. These proteins included filamin C, other known myofibrillar myopathy associated proteins, and a striking number of filamin C binding partners. Across the patients the patterns were extremely homogeneous. Xin actin-binding repeat containing protein 2, heat shock protein 27, nebulin-related-anchoring protein, and Rab35 could be verified as new filaminopathy biomarker candidates. In addition, further experiments identified heat shock protein 27 and Xin actin-binding repeat containing protein 2 as novel filamin C interaction partners and we could show that Xin actin-binding repeat containing protein 2 and the known interaction partner Xin actinbinding repeat containing protein 1 simultaneously associate with filamin C. Ten proteins showed significant lower spectral indices in aggregate samples compared with patient controls (ratio <0.56) including M-band proteins myomesin-1 and myomesin-2. Proteomic findings were consistent with previous and novel immunolocalization data. Our findings suggest that aggregates in filaminopathy have a largely organized structure of proteins also interacting under physiological conditions. Different filamin C mutations seem to lead to almost identical aggregate compositions. The finding that filamin C was detected as highly abundant protein in aggregates in filaminopathy indicates that our proteomic approach may be suitable to identify new candidate genes among the many MFM patients with so far unknown mutation. Molecular & Cellular Proteomics
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.