2019
DOI: 10.1186/s12859-018-2550-2
|View full text |Cite
|
Sign up to set email alerts
|

Computational discovery and annotation of conserved small open reading frames in fungal genomes

Abstract: Background: Small open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(9 citation statements)
references
References 48 publications
0
9
0
Order By: Relevance
“…Furthermore, the annotation errors are often propagated between species and the more "draft" genomes we produce, the more errors we create and propagate [3][4][5]. Other important challenges that have attracted interest recently include the prediction of small proteins/peptides coded by short open reading frames (sORFs) [26,27] or the identification of events such as stop codon recoding [28]. These atypical proteins are often overlooked by the standard gene prediction pipelines, and their annotation requires dedicated methods or manual curation.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, the annotation errors are often propagated between species and the more "draft" genomes we produce, the more errors we create and propagate [3][4][5]. Other important challenges that have attracted interest recently include the prediction of small proteins/peptides coded by short open reading frames (sORFs) [26,27] or the identification of events such as stop codon recoding [28]. These atypical proteins are often overlooked by the standard gene prediction pipelines, and their annotation requires dedicated methods or manual curation.…”
Section: Introductionmentioning
confidence: 99%
“…Randomly occurring ORFs are unlikely to be preserved through evolution, whereas highly conserved regions are likely to be functional. While bioinformatic analyses have successfully identified functional proteins, many recently published studies report on thousands or even millions of conserved non‐annotated regions (Mat‐Sharani & Firdaus‐Raih, ). However, translated smORFs exhibit a lower degree of evolutionary conservation than known genes (Storz et al., ).…”
Section: Commentarymentioning
confidence: 99%
“…In the case of genome annotation, genes encoding small open reading frames are frequently missed. Mat-Sharani and Firdaus-Raih [10] have developed a generic annotation workflow for targeting these elusive regions and applied it successfully to a massive dataset of 31 fungal genomes.…”
Section: Sequence Analysismentioning
confidence: 99%