The MSstats R-Bioconductor family of packages
is widely used for statistical analyses of quantitative bottom-up
mass spectrometry-based proteomic experiments to detect differentially
abundant proteins. It is applicable to a variety of experimental designs
and data acquisition strategies and is compatible with many data processing
tools used to identify and quantify spectral features. In the face
of ever-increasing complexities of experiments and data processing
strategies, the core package of the family, with the same name MSstats, has undergone a series of substantial updates.
Its new version MSstats v4.0 improves the usability,
versatility, and accuracy of statistical methodology, and the usage
of computational resources. New converters integrate the output of
upstream processing tools directly with MSstats,
requiring less manual work by the user. The package’s statistical
models have been updated to a more robust workflow. Finally, MSstats’ code has been substantially refactored to
improve memory use and computation speed. Here we detail these updates,
highlighting methodological differences between the new and old versions.
An empirical comparison of MSstats v4.0 to its previous
implementations, as well as to the packages MSqRob and DEqMS, on controlled mixtures and biological
experiments demonstrated a stronger performance and better usability
of MSstats v4.0 as compared to existing methods.
Liquid chromatography coupled with bottom-up mass spectrometry (LC-MS/ MS)-based proteomics is a versatile technology for identifying and quantifying proteins in complex biological mixtures. Postidentification, analysis of changes in protein abundances between conditions requires increasingly complex and specialized statistical methods. Many of these methods, in particular the family of open-source Bioconductor packages MSstats, are implemented in a coding language such as R. To make the methods in MSstats accessible to users with limited programming and statistical background, we have created MSstatsShiny, an R-Shiny graphical user interface (GUI) integrated with MSstats, MSstatsTMT, and MSstatsPTM. The GUI provides a point and click analysis pipeline applicable to a wide variety of proteomics experimental types, including label-free data-dependent acquisitions (DDAs) or data-independent acquisitions (DIAs), or tandem mass tag (TMT)-based TMT-DDAs, answering questions such as relative changes in the abundance of peptides, proteins, or post-translational modifications (PTMs). To support reproducible research, the application saves user's selections and builds an R script that programmatically recreates the analysis. MSstatsShiny can be installed locally via Github and Bioconductor, or utilized on the cloud at www.msstatsshiny.com. We illustrate the utility of the platform using two experimental data sets (MassIVE IDs MSV000086623 and MSV000085565).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.