2020
DOI: 10.1101/2020.05.05.078345
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A semi-supervised Bayesian approach for simultaneous protein sub-cellular localisation assignment and novelty detection

Abstract: The cell is compartmentalised into complex micro-environments allowing an array of specialised biological processes to be carried out in synchrony. Determining a protein's sub-cellular localisation to one or more of these compartments can therefore be a rst step in determining its function. High-throughput and high-accuracy mass spectrometry-based sub-cellular proteomic methods can now shed light on the localisation of thousands of proteins at once. Machine learning algorithms are then typically employed to ma… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
7

Relationship

6
1

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 131 publications
(212 reference statements)
0
13
0
Order By: Relevance
“…1d) 25 . The latter takes into account the uncertainty that arises when classifying proteins that reside in multiple locations, or unknown functional compartments and also those that dynamically move within the cell, and so provides a richer overall analysis of spatial proteomics data [25][26][27][28] .…”
Section: Resultsmentioning
confidence: 99%
“…1d) 25 . The latter takes into account the uncertainty that arises when classifying proteins that reside in multiple locations, or unknown functional compartments and also those that dynamically move within the cell, and so provides a richer overall analysis of spatial proteomics data [25][26][27][28] .…”
Section: Resultsmentioning
confidence: 99%
“…For example, Crook et al propose a semi-supervised Bayesian approach and uncover a novel group of Saccharomyces cerevisiae proteins trafficking from the ER to the early Golgi apparatus. [52] Additionally, differential localization could also be elucidated using joint models across conditions, with uncertainty quantification to assist in ranking candidates for future experimental investigation. Similarly, the relationship between post-translational modification and differential localization could be examined by comparing profiles for the modified and unmodified protein forms.…”
Section: Quantifying Uncertainty With Bayesian Modelingmentioning
confidence: 99%
“…Multivariate data analysis is conducted using the MS proteomics packages MSnbase 29 and pRoloc 30 , which provide a robust framework for processing, visualisation, and interrogation of spatial proteomics data as part of the open-source, open-development Bioconductor 31,32 suite of R software 33 . Additional modalities for phenotype discovery 34,35 , transfer learning from heterogeneous data sources 36 , assessment of cluster separation as a data resolution metric 37 and probabilistic classifiers such as Bayesian mixture modelling 38 have recently been integrated into the pRoloc pipeline.…”
mentioning
confidence: 99%