2014
DOI: 10.1101/gr.159384.113
|View full text |Cite
|
Sign up to set email alerts
|

Comparative validation of the D. melanogaster modENCODE transcriptome annotation

Abstract: Accurate gene model annotation of reference genomes is critical for making them useful. The modENCODE project has improved the D. melanogaster genome annotation by using deep and diverse high-throughput data. Since transcriptional activity that has been evolutionarily conserved is likely to have an advantageous function, we have performed large-scale interspecific comparisons to increase confidence in predicted annotations. To support comparative genomics, we filled in divergence gaps in the Drosophila phyloge… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

8
214
0
1

Year Published

2015
2015
2019
2019

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 167 publications
(223 citation statements)
references
References 64 publications
8
214
0
1
Order By: Relevance
“…Overall, the distribution of consensus promoter widths observed are consistent with those determined in D. melanogaster using CAGE ( Figure 2A) (Hoskins et al 2011;Chen et al 2014). A proportion of the consensus promoter widths are long, including 1104 (10.4%) with widths longer than 30 bp (Figure 2A).…”
Section: Discussionsupporting
confidence: 70%
See 1 more Smart Citation
“…Overall, the distribution of consensus promoter widths observed are consistent with those determined in D. melanogaster using CAGE ( Figure 2A) (Hoskins et al 2011;Chen et al 2014). A proportion of the consensus promoter widths are long, including 1104 (10.4%) with widths longer than 30 bp (Figure 2A).…”
Section: Discussionsupporting
confidence: 70%
“…However, three of the four overrepresented dinucleotides (GA, GC, and GG) have guanines at 21, which is observed less commonly in metazoans. D. melanogaster, the most closely related species for which CAGE data are available (Hoskins et al 2011;Chen et al 2014), is enriched for YR at [21, +1]; no enrichment of dinucleotides with G at 21 is reported. In human, where core promoters tend to be GC-rich (Fitzgerald et al 2004(Fitzgerald et al , 2006, YR, but no GN dinucleotides, are enriched at initiation sites (Sandelin et al 2007;Frith et al 2008).…”
Section: Discussionmentioning
confidence: 99%
“…Overall, the expression profiles were similar to one another, with Spearman correlation coefficients of species pairs ranging between 0.68 and 0.90. Our expression data were also consistent with the previous studies of gene expression across multiple Drosophila species (Chen et al., 2014; Zhang, Sturgill, Parisi, Kumar & Oliver, 2007; Figure S2a).…”
Section: Resultsmentioning
confidence: 99%
“…We compared our expression data with two previous studies of gene expression across multiple Drosophila species (Chen et al., 2014; Zhang et al., 2007)—GSE99574 is based on RNA‐seq and GSE6640 is based on microarrays (herein referred to as “reference studies”). For simplicity, we used only “male replicate 1” in each reference study for whole‐body expression.…”
Section: Methodsmentioning
confidence: 99%
“…We used BLAST to identify orthologs in the D. suzukii assembly of Ometto et al. (2013), the unpublished draft D. subpulchrella assembly described above, a D. biarmipes assembly (Chen et al., 2014), and a second‐generation D. simulans assembly (Hu, Eisen, Thornton, & Andolfatto, 2013). Data for H2A and amylase were eliminated because H2A had multiple nonidentical paralogs in each species and homologs of D. melanogaster amylase could not be found in the assemblies.…”
Section: Methodsmentioning
confidence: 99%