Alternating EM algorithm for a bilinear model in isoform quantification from RNA-seq data

Deng, Wenjiang; Mou, Tian; Kalari, Krishna R.; Niu, Nifang; Wang, Liewei; Pawitan, Yudi; Vu, Trung Nghia

doi:10.1093/bioinformatics/btz640

Cited by 12 publications

(20 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The FASTQ files were obtained from the three cohorts and RNA‐seq reads were aligned to the human genome hg19. Also, XAEM 12 was used to obtain the gene expression (transcript per million ‐ TPM) from the RNA‐seq data. The calculation process of XAEM followed the instructions provided at http://fafner.meb.ki.se/biostatwiki/xaem/.…”

Section: Methodsmentioning

confidence: 99%

The transcriptome‐wide landscape of molecular subtype‐specific mRNA expression profiles in acute myeloid leukemia

et al. 2021

Self Cite

View full text Add to dashboard Cite

Molecular classification of acute myeloid leukemia (AML) aids prognostic stratification and clinical management. Our aim in this study is to identify transcriptome‐wide mRNAs that are specific to each of the molecular subtypes of AML. We analyzed RNA‐sequencing data of 955 AML samples from three cohorts, including the BeatAML project, the Cancer Genome Atlas, and a cohort of Swedish patients to provide a comprehensive transcriptome‐wide view of subtype‐specific mRNA expression. We identified 729 subtype‐specific mRNAs, discovered in the BeatAML project and validated in the other two cohorts. Using unique proteomics data, we also validated the presence of subtype‐specific mRNAs at the protein level, yielding a rich collection of potential protein‐based biomarkers for the AML community. To enable the exploration of subtype‐specific mRNA expression by the broader scientific community, we provide an interactive resource to the public.

show abstract

Section: Methodsmentioning

confidence: 99%

The transcriptome‐wide landscape of molecular subtype‐specific mRNA expression profiles in acute myeloid leukemia

et al. 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…The joint estimation procedure is called an AEM algorithm, for which the exact formulas are given in Deng et al (2020) . At convergence, the output β represents the estimated transcript abundances for individual cells.…”

Section: Methodsmentioning

confidence: 99%

“…The key methodological innovation of Scasa is the in silico construction of the TCs, each with a corresponding initial design matrix X that adapts to the actual sequencing protocol used. As discussed previously ( Deng et al , 2020 ), X also automatically accounts for unknown biases in a sequencing protocol. Moreover, an explicitly available X makes the statistical processing of the paralogs (isoforms with highly similar sequences) tractable.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Isoform-level quantification for single-cell RNA sequencing

et al. 2021

Self Cite

View full text Add to dashboard Cite

Motivation RNA expression at isoform level is biologically more informative than at gene level and can potentially reveal cellular subsets and corresponding biomarkers that are not visible at gene level. However, due to the strong 3’ bias sequencing protocol, mRNA quantification for high-throughput single-cell RNA sequencing such as Chromium Single Cell 3’ 10x Genomics is currently performed at the gene level. Results We have developed an isoform-level quantification method for high-throughput single-cell RNA sequencing by exploiting the concepts of transcription clusters and isoform paralogs. The method, called Scasa, compares well in simulations against competing approaches including Alevin, Cellranger, Kallisto, Salmon, Terminus and STARsolo at both isoform- and gene-level expression. The reanalysis of a CITE-Seq dataset with isoform-based Scasa reveals a subgroup of CD14 monocytes missed by gene-based methods. Availability Implementation of Scasa including source code, documentation, tutorials, and test data supporting this study is available at Github: https://github.com/eudoraleer/scasa and Zenodo: https://doi.org/10.5281/zenodo.5712503. Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

“…The XAEM method ( Deng et al., 2019 ) adopts a bilinear model for transcript-level quantification that aims to perform multi-sample inference, considering evidence from multiple samples within the same RNA-seq experiment jointly when performing quantification. The model can be viewed as a generalization of more common transcript quantification models where the so-called “design” matrix is fixed, and inference solves for the maximum likelihood parameters under this design matrix and the observed sequences.…”

mentioning

confidence: 99%

Algorithms meet sequencing technologies – 10th edition of the RECOMB-Seq workshop

Patro

Salmela

2021

iScience

View full text Add to dashboard Cite

Alternating EM algorithm for a bilinear model in isoform quantification from RNA-seq data

Cited by 12 publications

References 27 publications

The transcriptome‐wide landscape of molecular subtype‐specific mRNA expression profiles in acute myeloid leukemia

The transcriptome‐wide landscape of molecular subtype‐specific mRNA expression profiles in acute myeloid leukemia

Isoform-level quantification for single-cell RNA sequencing

Algorithms meet sequencing technologies – 10th edition of the RECOMB-Seq workshop

Contact Info

Product

Resources

About