2016
DOI: 10.1093/nar/gkw210
|View full text |Cite
|
Sign up to set email alerts
|

Coding exon-structure aware realigner (CESAR) utilizes genome alignments for accurate comparative gene annotation

Abstract: Identifying coding genes is an essential step in genome annotation. Here, we utilize existing whole genome alignments to detect conserved coding exons and then map gene annotations from one genome to many aligned genomes. We show that genome alignments contain thousands of spurious frameshifts and splice site mutations in exons that are truly conserved. To overcome these limitations, we have developed CESAR (Coding Exon-Structure Aware Realigner) that realigns coding exons, while considering reading frame and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
63
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
3
3
3

Relationship

4
5

Authors

Journals

citations
Cited by 49 publications
(65 citation statements)
references
References 58 publications
(62 reference statements)
2
63
0
Order By: Relevance
“…In the red deer ( Cervus elaphus ), which possesses a functional VNS, we found a confirmed frameshifting mutation in the last exon (Table ); however, this mutation is far downstream of the last transmembrane domain therefore just leads to a different C‐terminal protein sequence (Figure ). Such C‐terminal alterations are a common evolutionary change in conserved genes (Sharma et al, ). Further supporting that Trpc2 is a functional gene in Cervus elaphus , we estimated Trpc2 to evolve under purifying selection in this species without any evidence for significantly relaxed selection rates ( p ‐value > .1; Table ).…”
Section: Resultsmentioning
confidence: 99%
“…In the red deer ( Cervus elaphus ), which possesses a functional VNS, we found a confirmed frameshifting mutation in the last exon (Table ); however, this mutation is far downstream of the last transmembrane domain therefore just leads to a different C‐terminal protein sequence (Figure ). Such C‐terminal alterations are a common evolutionary change in conserved genes (Sharma et al, ). Further supporting that Trpc2 is a functional gene in Cervus elaphus , we estimated Trpc2 to evolve under purifying selection in this species without any evidence for significantly relaxed selection rates ( p ‐value > .1; Table ).…”
Section: Resultsmentioning
confidence: 99%
“…The approach performs a series of filter steps to remove artifacts related to genome assembly or alignment and evolutionary exonintron structure changes in conserved genes. These steps comprise excluding those deletions that overlap assembly gaps in a query genome (131), re-aligning all coding exons with CESAR to detect evolutionary splice site shifts and to avoid spurious frameshifts due to alignment ambiguities (132,133), and excluding alignments to paralogs or processed pseudogenes. Finally, the approach considers all principal or alternative APPRIS isoforms of a gene (134) and outputs data for the isoform with the smallest number of inactivating mutations.…”
Section: Detecting Genes Lost In Cetaceans During the Transition Frommentioning
confidence: 99%
“…All specimens were blinded for measurements as described above. Using the Coding Exon-Structure Aware Realigner (CESAR) 9 we aligned exons from jerboa (JacJac1.0, NCBI) and mouse (mm10, NCBI) genomes to generate a custom 1:1 orthologous transcript annotation set with no paralogs. Briefly, we first generated a whole genome alignment between the mouse and jerboa genome assemblies 81 .…”
Section: Skeletal Preparations and Measurementsmentioning
confidence: 99%