2017
DOI: 10.7554/elife.27860
|View full text |Cite
|
Sign up to set email alerts
|

Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins

Abstract: Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation pa… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

13
168
1

Year Published

2017
2017
2021
2021

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 104 publications
(185 citation statements)
references
References 90 publications
13
168
1
Order By: Relevance
“…Publications of two draft human proteomes have recently provided protein evidence for the majority of previously-annotated protein-coding genes, in addition to some previously unknown proteins 30 31 . These studies and others 32,33 suggest that current reference annotation has not yet fully captured the protein-coding potential of the genome. To validate the coding potential of novel loci identified in this study, we searched the unmatched spectra from 30 human tissue/cell types (see Methods) against the novel predicted ORFs described in this study.…”
Section: Validation Using Mass Spectrometrymentioning
confidence: 95%
“…Publications of two draft human proteomes have recently provided protein evidence for the majority of previously-annotated protein-coding genes, in addition to some previously unknown proteins 30 31 . These studies and others 32,33 suggest that current reference annotation has not yet fully captured the protein-coding potential of the genome. To validate the coding potential of novel loci identified in this study, we searched the unmatched spectra from 30 human tissue/cell types (see Methods) against the novel predicted ORFs described in this study.…”
Section: Validation Using Mass Spectrometrymentioning
confidence: 95%
“…In recent studies, thousands of alternative proteins were experimentally detected in human cell lines (Vanderperre et al 2013;Samandi et al 2017;Brunet et al 2019). In P. patens, we found tens of thousands of sORFs (CDS-sORFs) that overlapped with the CDS of protein-coding genes.…”
Section: Translation Of Sorfs In Mossmentioning
confidence: 99%
“…After this sORF BED file was intersected with the latest moss genome annotation V3.3 , the locations of the sORFs on transcripts were determined, resulting in the further classification of genic-sORFs into uORFs, dORFs, CDS-sORFs, and interlaced-sORFs. sORFs were denoted as upstream or downstream if they were fully separated from the longer protein-coding ORF as previously described (Calviello et al 2016;Samandi et al 2017).…”
Section: Sorf Classificationmentioning
confidence: 99%
“…In recent studies, thousands of alternative proteins were experimentally detected in human cell lines (Vanderperre et al 2013; Samandi et al 2017). In P. patens , we found tens of thousands of sORFs (CDS-sORFs) that overlapped with the CDS of protein-coding genes, 305 of which were translatable.…”
Section: Discussionmentioning
confidence: 99%