2014
DOI: 10.1093/bioinformatics/btu757
|View full text |Cite
|
Sign up to set email alerts
|

WemIQ: an accurate and robust isoform quantification method for RNA-seq data

Abstract: We present a weighted-log-likelihood expectation maximization method on isoform quantification (WemIQ). WemIQ integrates an effective bias removal with a weighted expectation maximization (EM) algorithm to distribute reads among isoforms efficiently. The weight represents the oversampling or undersampling of sequence reads and is estimated through a generalized Poisson model without any presumption on the bias sources and formats. WemIQ significantly improves the quantification of isoform and gene expression a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
20
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(20 citation statements)
references
References 32 publications
0
20
0
Order By: Relevance
“…Compared to microarrays, RNA expression profile has two majorly advantageous features to enable to identify complexity of isoform variability on a single gene to produce various splicing events on transcripts and another nature, allelic specific expression and allelic imbalance resulted from genetic differences in transcriptional rates (Beretta et al, 2014;Bernard et al, 2014;Deng et al, 2011;Hiller et al, 2009;Hiller and Wong, 2013;Howard and Heber, 2010;Hu et al, 2014;Jiang and Wong, 2009;Katz et al, 2010;Kaur et al, 2012;Kimes et al, 2014;Leon-Novelo et al, 2014;Lerch et al, 2012;Li and Jiang, 2012;Ma and Zhang, 2013;Mezlini et al, 2013;Mills et al, 2013;Nariai et al, 2013;Nariai et al, 2014;Ng et al, 2014;Nicolae et al, 2011;Niu et al, 2014;Pandey et al, 2013;Patro et al, 2014;Rehrauer et al, 2013;Safikhani et al, 2013;Shi and Jiang, 2013;Suo et al, 2014;Trapnell et al, 2010;Vardhanabhuti et al, 2013;Wang et al, 2010;Wu et al, 2011b;Yalamanchili et al, 2014;Zhang et al, 2014;Zheng and Chen, 2009). …”
Section: Discussionmentioning
confidence: 99%
“…Compared to microarrays, RNA expression profile has two majorly advantageous features to enable to identify complexity of isoform variability on a single gene to produce various splicing events on transcripts and another nature, allelic specific expression and allelic imbalance resulted from genetic differences in transcriptional rates (Beretta et al, 2014;Bernard et al, 2014;Deng et al, 2011;Hiller et al, 2009;Hiller and Wong, 2013;Howard and Heber, 2010;Hu et al, 2014;Jiang and Wong, 2009;Katz et al, 2010;Kaur et al, 2012;Kimes et al, 2014;Leon-Novelo et al, 2014;Lerch et al, 2012;Li and Jiang, 2012;Ma and Zhang, 2013;Mezlini et al, 2013;Mills et al, 2013;Nariai et al, 2013;Nariai et al, 2014;Ng et al, 2014;Nicolae et al, 2011;Niu et al, 2014;Pandey et al, 2013;Patro et al, 2014;Rehrauer et al, 2013;Safikhani et al, 2013;Shi and Jiang, 2013;Suo et al, 2014;Trapnell et al, 2010;Vardhanabhuti et al, 2013;Wang et al, 2010;Wu et al, 2011b;Yalamanchili et al, 2014;Zhang et al, 2014;Zheng and Chen, 2009). …”
Section: Discussionmentioning
confidence: 99%
“…Due to the presence of numerous isoforms in existing annotations, inference on their abundance from RNA-seq reads has been an active field of research since 2009 (Jiang and Wong, 2009; Trapnell et al, 2010; Li et al, 2011; Zhang et al, 2014). A necessary step is to first map (or align) reads to reference genomes so that researchers know the numbers of reads generated from each exon.…”
Section: Introductionmentioning
confidence: 99%
“…Then, a common approach to summarize RNA-seq reads is to categorize the reads by the genomic regions to which they map so that the number of reads in different genomic regions can be used to distinguish the abundance of various isoforms. As different isoforms may consist of overlapping but not identical exons, many methods divide exons into subexons , which are defined as transcribed regions between every two adjacent splicing sites in annotations (Li et al, 2011; Zhang et al, 2014; Ye and Li, 2016). By this definition, every gene is composed of non-overlapping subexons and introns (i.e., non-transcribed genomic regions).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…POME incorporates the base-specific variation and between-base dependence that affect the read coverage profile throughout each transcript (Hu et al, 2012). WemIQ assigns different weights to reads from different gene regions when calculating the weighted log-likelihood (Zhang et al, 2015). All these aforementioned methods use the Poisson distribution to model the read counts and treat the bias values as weight factors to the Poisson rate.…”
Section: Introductionmentioning
confidence: 99%