2010
DOI: 10.1093/nar/gkq1015
|View full text |Cite
|
Sign up to set email alerts
|

Accurate quantification of transcriptome from RNA-Seq data by effective length normalization

Abstract: We propose a novel, efficient and intuitive approach of estimating mRNA abundances from the whole transcriptome shotgun sequencing (RNA-Seq) data. Our method, NEUMA (Normalization by Expected Uniquely Mappable Area), is based on effective length normalization using uniquely mappable areas of gene and mRNA isoform models. Using the known transcriptome sequence model such as RefSeq, NEUMA pre-computes the numbers of all possible gene-wise and isoform-wise informative reads: the former being sequences mapped to a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
86
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 106 publications
(87 citation statements)
references
References 15 publications
1
86
0
Order By: Relevance
“…Available abundance estimation methods include direct computation (9,10) and model-based approaches. Many model-based studies (1,(11)(12)(13)(14) have used maximum-likelihood approaches to estimate isoform abundance.…”
mentioning
confidence: 99%
“…Available abundance estimation methods include direct computation (9,10) and model-based approaches. Many model-based studies (1,(11)(12)(13)(14) have used maximum-likelihood approaches to estimate isoform abundance.…”
mentioning
confidence: 99%
“…Population sizes for the common ancestors (> 8 kya and ~2 kya) of RJF and VC were obtained from our MSMC analysis. Since MSMC has a low power to estimate population size at relatively recent times, the effective population sizes for present day RJF and VC populations were taken from elsewhere [68] (1.6 × 10 5 and 4 × 10 5 , respectively). Generation time (g) and mutation rate per year (u) for chicken used here is 1 year and 1.91 × 10 −9 , respectively [69].…”
Section: Demographic History and Coalescent Simulationsmentioning
confidence: 99%
“…The expression level (FPKM) for each gene in each tissue was retrieved and transformed according to log 2 (FPKM + 1) [68]. The difference of expression level for each gene between VC and RJF was calculated using log 2 ((FPKM + 1) VC /(FPKM + 1) RJF ).…”
Section: Comparison Of Gene Expressionmentioning
confidence: 99%
“…Gene expression levels were estimated using an in-house developed application, which calculates fragments per kilobase of expressed exons per million mapped reads (FPKM values) in a manner similar to NEUMA. 32 In our approach, to calculate an effective length of genes, instead of using simulated data, we used a pooled set of aligned RNA-Seq reads for assessing genome mapability. Further analyses of gene expression results and generation of plots were performed in R (version 3.1.2), with the aid of "plyr" and "ggpot2" packages.…”
Section: Transcriptome Analysismentioning
confidence: 99%