2019
DOI: 10.1038/s41589-019-0425-0
|View full text |Cite
|
Sign up to set email alerts
|

Accurate annotation of human protein-coding small open reading frames

Abstract: Functional protein-coding small open reading frames (smORFs) are emerging as an important class of genes. However, the number of translated smORFs in the human genome is unclear because proteogenomic methods are not sensitive enough, and, as we show, Ribo-Seq strategies require additional measures to ensure comprehensive and accurate smORF annotation. Here, we integrate de novo transcriptome assembly and Ribo-Seq into an improved workflow that

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

10
245
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 175 publications
(255 citation statements)
references
References 62 publications
10
245
0
Order By: Relevance
“…Our extensive MHC-I immunopeptidome proteomics analysis allowed us to validate that nuORFs are translated and presented, and provided ground truth for improving prediction quality. The >6,000 nuORF peptides we identified with stringent criteria as presented on MHC I, have similar biochemical and biophysical characteristics to peptides derived from annotated proteins, and dramatically expand the number of nuORFs detected by mass spectrometry Martinez et al 2019;Ma et al 2016Ma et al , 2014, particularly in primary cancer samples. As 50.6% of nuORFs were detected in more than one MS sample, they are recurrently translated and MHC Ipresented, with identical peptide sequences frequently detected in patient-derived cancer cell lines and in B721.221 mono-allelic cells expressing matching alleles, highlighting the robustness of nuORF prediction and the dependency of MHC I presentation on the HLA allele expressed in a given sample.…”
Section: Discussionmentioning
confidence: 77%
See 1 more Smart Citation
“…Our extensive MHC-I immunopeptidome proteomics analysis allowed us to validate that nuORFs are translated and presented, and provided ground truth for improving prediction quality. The >6,000 nuORF peptides we identified with stringent criteria as presented on MHC I, have similar biochemical and biophysical characteristics to peptides derived from annotated proteins, and dramatically expand the number of nuORFs detected by mass spectrometry Martinez et al 2019;Ma et al 2016Ma et al , 2014, particularly in primary cancer samples. As 50.6% of nuORFs were detected in more than one MS sample, they are recurrently translated and MHC Ipresented, with identical peptide sequences frequently detected in patient-derived cancer cell lines and in B721.221 mono-allelic cells expressing matching alleles, highlighting the robustness of nuORF prediction and the dependency of MHC I presentation on the HLA allele expressed in a given sample.…”
Section: Discussionmentioning
confidence: 77%
“…These nuORFs are derived from the 5' and 3' untranslated regions (UTRs), overlapping yet outof-frame alternative ORFs in annotated protein-coding genes, long non-coding RNAs (lncRNAs), pseudogenes and other transcripts currently annotated as non-protein coding (Fields et al 2015;Chew et al 2013). Ribo-seq analysis of HEK293T, HeLa-S3, and K562 cell lines and of human fibroblasts infected with HSV-1 and HCMV has identified translated nuORFs that contribute peptides to the MHC I immunopeptidome, suggesting that nuORFs could also have an immunological function Martinez et al 2019). However, a global understanding of the extent to which nuORFs contribute to the immunopeptidomes of healthy and cancer tissues, as well as the diversity and tissue specificity of nuORFs is still lacking.…”
Section: Introductionmentioning
confidence: 99%
“…Studies using microarrays, such as the IMI MARCAR Project [22] and Microarray Innovations in Leukemia (MILE) [23], have made great contributions to medical researches. We collected 617462 unique smORFs from SmProt [8], sORFs.org [7] and the study by Thomas et al [9]. Using probe reannotation, we remaped the probes of microarrays to smORFs and estimated smORF RNA expressions (Fig.…”
Section: Smorf Rna Quanti Cation Based On Microarraymentioning
confidence: 99%
“…The CEL les were processed using R package oligo (v1.48.0) [40] and ff (v2. [2][3][4][5][6][7][8][9][10][11][12][13][14]. Package ff was used with default parameters.…”
Section: Microarray Data Processingmentioning
confidence: 99%
See 1 more Smart Citation