2021
DOI: 10.1371/journal.pgen.1009585
|View full text |Cite
|
Sign up to set email alerts
|

Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach

Abstract: Small proteins play essential roles in bacterial physiology and virulence, however, automated algorithms for genome annotation are often not yet able to accurately predict the corresponding genes. The accuracy and reliability of genome annotations, particularly for small open reading frames (sORFs), can be significantly improved by integrating protein evidence from experimental approaches. Here we present a highly optimized and flexible bioinformatics workflow for bacterial proteogenomics covering all steps fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
41
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 27 publications
(45 citation statements)
references
References 72 publications
4
41
0
Order By: Relevance
“…While this is challenging for eukaryotes due to their larger genome size, efficient approaches were developed to better characterize the coding potential of prokaryotic genomes [ 1 , 2 ]. More recently, proteogenomics has been increasingly employed to discover small proteins, since short open reading frames typically are difficult to predict by conventional genome annotation tools [ 3 , 4 ]. Proteogenomic applications were also extended to the study of gene expression regulation at transcript and protein levels or to the identification of cancer-specific protein variants [ 5 , 6 ].…”
Section: Introductionmentioning
confidence: 99%
“…While this is challenging for eukaryotes due to their larger genome size, efficient approaches were developed to better characterize the coding potential of prokaryotic genomes [ 1 , 2 ]. More recently, proteogenomics has been increasingly employed to discover small proteins, since short open reading frames typically are difficult to predict by conventional genome annotation tools [ 3 , 4 ]. Proteogenomic applications were also extended to the study of gene expression regulation at transcript and protein levels or to the identification of cancer-specific protein variants [ 5 , 6 ].…”
Section: Introductionmentioning
confidence: 99%
“…We also considered recently published mass spectrometry data for S. aureus ( 41 ) and detected smORFs with SALT & Pepper pipeline ( https://gitlab.com/s.fuchs/pepper ) ( 41 ). The pipeline uses a minimum of one unique peptide larger than 6 amino acids detected in at least two biological replicates and with a minimum score of 40 for unmodified and modified peptides, a minimum delta score of 6 for unmodified peptides and 17 for modified peptides, and a fixed false discovery rate (FDR) of 0.0001 for peptides and 0.01 for proteins ( 41 ). The SALT & Pepper pipeline comprises genomic prediction of smORFs and mass spectrometry verification and detected in total 176 unique small proteins with a length of up to 100 amino acids.…”
Section: Resultsmentioning
confidence: 99%
“…The Ribo-Seq data for S. aureus Newman were uploaded in GEO under accession number GSE150601. Mass spectrometry data for S. aureus are from ( 41 ) and for E. coli from ( 42 ).…”
Section: Methodsmentioning
confidence: 99%
“…In addition, the translation of thousands of small open reading frames (smORFs; <100 codons) located on long non-coding RNAs (lncRNAs) or mRNAs was confirmed experimentally and, therefore, is another source of peptides in cellular and secreted peptidomes [ 22 , 23 , 24 , 25 , 26 , 27 , 28 ]. However, the abundance of these groups of peptides, their half-life and degradation mechanisms are still poorly understood [ 29 , 30 , 31 , 32 ].…”
Section: Introductionmentioning
confidence: 99%