2019
DOI: 10.1093/bioinformatics/btz640
|View full text |Cite
|
Sign up to set email alerts
|

Alternating EM algorithm for a bilinear model in isoform quantification from RNA-seq data

Abstract: Motivation Estimation of isoform-level gene expression from RNA-seq data depends on simplifying assumptions, such as uniform read distribution, that are easily violated in real data. Such violations typically lead to biased estimates. Most existing methods provide bias correction step(s), which is based on biological considerations—such as GC content—and applied in single samples separately. The main problem is that not all biases are known. R… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

4
2

Authors

Journals

citations
Cited by 12 publications
(20 citation statements)
references
References 27 publications
0
20
0
Order By: Relevance
“…The FASTQ files were obtained from the three cohorts and RNA‐seq reads were aligned to the human genome hg19. Also, XAEM 12 was used to obtain the gene expression (transcript per million ‐ TPM) from the RNA‐seq data. The calculation process of XAEM followed the instructions provided at http://fafner.meb.ki.se/biostatwiki/xaem/.…”
Section: Methodsmentioning
confidence: 99%
“…The FASTQ files were obtained from the three cohorts and RNA‐seq reads were aligned to the human genome hg19. Also, XAEM 12 was used to obtain the gene expression (transcript per million ‐ TPM) from the RNA‐seq data. The calculation process of XAEM followed the instructions provided at http://fafner.meb.ki.se/biostatwiki/xaem/.…”
Section: Methodsmentioning
confidence: 99%
“…The joint estimation procedure is called an AEM algorithm, for which the exact formulas are given in Deng et al (2020) . At convergence, the output β represents the estimated transcript abundances for individual cells.…”
Section: Methodsmentioning
confidence: 99%
“…The key methodological innovation of Scasa is the in silico construction of the TCs, each with a corresponding initial design matrix X that adapts to the actual sequencing protocol used. As discussed previously ( Deng et al , 2020 ), X also automatically accounts for unknown biases in a sequencing protocol. Moreover, an explicitly available X makes the statistical processing of the paralogs (isoforms with highly similar sequences) tractable.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The XAEM method ( Deng et al., 2019 ) adopts a bilinear model for transcript-level quantification that aims to perform multi-sample inference, considering evidence from multiple samples within the same RNA-seq experiment jointly when performing quantification. The model can be viewed as a generalization of more common transcript quantification models where the so-called “design” matrix is fixed, and inference solves for the maximum likelihood parameters under this design matrix and the observed sequences.…”
mentioning
confidence: 99%