2020
DOI: 10.1101/2020.09.17.302182
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Protein Abundance Prediction Through Machine Learning Methods

Abstract: Proteins are responsible for most physiological processes, and their abundance provides crucial information for systems biology research. However, absolute protein quantification, as determined by mass spectrometry, still has limitations in capturing the protein pool. Protein abundance is impacted by translation kinetics, which rely on features of codons. In this study, we evaluated the effect of codon usage bias of genes on protein abundance. Notably, we observed differences regarding codon usage patterns bet… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 63 publications
0
5
0
Order By: Relevance
“…Standard codon usage metrics were shown to be highly predictive of protein abundance. For instance, an AdaBoost model trained on a number of codon usage metrics in S. cerevisiae genes coding for high-abundance proteins (top 10%) and low-abundance proteins (lowest 10%) was highly predictive of these extremes of protein abundance ( R 2 = 0.95) ( Ferreira et al, 2020 ).…”
Section: Regulatory Mechanisms In Specific Coding and Non-coding Regionsmentioning
confidence: 99%
“…Standard codon usage metrics were shown to be highly predictive of protein abundance. For instance, an AdaBoost model trained on a number of codon usage metrics in S. cerevisiae genes coding for high-abundance proteins (top 10%) and low-abundance proteins (lowest 10%) was highly predictive of these extremes of protein abundance ( R 2 = 0.95) ( Ferreira et al, 2020 ).…”
Section: Regulatory Mechanisms In Specific Coding and Non-coding Regionsmentioning
confidence: 99%
“…Another approach, developed by Terai and Asai (2020), uses features such as the accessibility around the Shine-Dalgarno sequence, minimum free energy of the mRNA molecule, Viterbi score, and inside-outside score. Further, Ferreira et al (2021) explored codon usage bias information to train an AdaBoost regression model, achieving higher correlations than previous approaches without the usage of transcriptomics data.…”
Section: Parrot: Prediction Of Enzyme Abundances Using Protein-constr...mentioning
confidence: 99%
“…Moreover, issues may arise from errors in the installation, configuration, or use of 'competitor' frameworks. Typical examples are misunderstanding memory management and/or using insufficient compute resources (Balaji and Allen, 2018), or failing to use comparable resource budgets (Ferreira et al, 2021).…”
Section: The Need For Standardized Benchmarksmentioning
confidence: 99%