2019
DOI: 10.1101/2019.12.17.880294
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Foster thy young: Enhanced prediction of orphan genes in assembled genomes

Abstract: The evolutionary rapid emergence of new genes gives rise to "orphan genes" that share no sequence homology to genes in closely related genomes. These genes provide organisms with a reservoir of genetic elements to quickly respond to changing selection pressures. Gene annotation pipelines that combine ab initio machine-learning with sequence homology-based searches are efficient in identifying basal genes with a long evolutionary history. However, their ability to identify orphan genes and other young genes has… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
3

Relationship

5
2

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 144 publications
(277 reference statements)
0
13
0
Order By: Relevance
“…The 26 NAM genomes were annotated using a hybrid evidence and ab initio based gene prediction pipeline (81). Evidence-based predictions were directly inferred from the assembled transcripts, which were generated using five different genome-guided transcript assembly RNA-seq reads from each library were mapped to their respective NAM genomes using STAR (v2.5.3a) (88) with an iterative 2-pass mapping approach in which splice junctions generated from the first round were used to refine alignments in the subsequent round.…”
Section: Gene Model Annotationmentioning
confidence: 99%
“…The 26 NAM genomes were annotated using a hybrid evidence and ab initio based gene prediction pipeline (81). Evidence-based predictions were directly inferred from the assembled transcripts, which were generated using five different genome-guided transcript assembly RNA-seq reads from each library were mapped to their respective NAM genomes using STAR (v2.5.3a) (88) with an iterative 2-pass mapping approach in which splice junctions generated from the first round were used to refine alignments in the subsequent round.…”
Section: Gene Model Annotationmentioning
confidence: 99%
“…A systematic analysis of current computational methods for genome annotation indicates many orphan genes may be missed in annotation projects (Li et al, 2021). This is because genes are often identified from sequenced genomes by combining evidence based on homology with other species (Meyer and Durbin, 2004;Proux-Wéra et al, 2012) with ab initio machine-learning predictions by detecting canonical sequence motifs (e.g., splice junctions) (Cantarel et al, 2008;Hoff et al, 2016).…”
Section: Introductionmentioning
confidence: 99%
“…e ., species-specific genes). As ab initio predictions of young genes can be unreliable (Seetharam et al, 2019), these were excluded. Finally, redundant copies of genes between direct evidence and ab initio predictions were identified and removed using Mikado compare (2.0rc2; (Venturini et al, 2018)) and merging was performed locus by locus, incorporating additional isoforms when necessary.…”
Section: Methodsmentioning
confidence: 99%