2020
DOI: 10.1038/s41598-020-67513-5
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing droplet-based single-nucleus RNA-seq resolution using the semi-supervised machine learning classifier DIEM

Abstract: Single-nucleus RnA sequencing (snRnA-seq) measures gene expression in individual nuclei instead of cells, allowing for unbiased cell type characterization in solid tissues. We observe that snRNA-seq is commonly subject to contamination by high amounts of ambient RNA, which can lead to biased downstream analyses, such as identification of spurious cell types if overlooked. We present a novel approach to quantify contamination and filter droplets in snRNA-seq experiments, called Debris Identification using Expec… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
61
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 77 publications
(61 citation statements)
references
References 37 publications
0
61
0
Order By: Relevance
“…We identified a total of 47894 high-quality, snRNAseq profiles. Debris-contaminated nuclei were removed with Seurat using cutoff on number of genes per nuclei or mitochondrial gene percentage, as well as using semi-supervised machine learning classifier Debris Identification using Expectation Maximization (DIEM)[42]. We classified nuclei into the cell types with SingleR (v 1.6.1) using DropViz Cerebellum MetaCells reference [43] ( Figure 1B ).…”
Section: Resultsmentioning
confidence: 99%
“…We identified a total of 47894 high-quality, snRNAseq profiles. Debris-contaminated nuclei were removed with Seurat using cutoff on number of genes per nuclei or mitochondrial gene percentage, as well as using semi-supervised machine learning classifier Debris Identification using Expectation Maximization (DIEM)[42]. We classified nuclei into the cell types with SingleR (v 1.6.1) using DropViz Cerebellum MetaCells reference [43] ( Figure 1B ).…”
Section: Resultsmentioning
confidence: 99%
“…The fastq files were then aligned using kallisto 122 , and bustools 123 summarized the cell/gene transcript counts in a matrix for each sample using the “lamanno” workflow for scRNAseq. Each library was then processed using DIEM 124 to eliminate debris and empty droplets. In parallel, “cellranger counts” was also used to align the scRNAseq reads using the STAR 125 aligner to produce the bam files necessary for demultiplexing the individual of origin, based on the genotype information using souporcell 126 and demuxlet 127 .…”
Section: Methodsmentioning
confidence: 99%
“…However, applying scRNA-seq technology to precious, archived human tissues, such as liver biopsies or resections, has proven to be challenging as it is not possible to dissociate intact cells from these existing biopsies of solid tissues. Single nucleus RNA sequencing (snRNA-seq) techniques [ 11 ] have overcome these technical challenges [ 12 ] and enabled cell-type level characterization of frozen solid tissues [ 13 16 ]. As scRNA-seq and snRNA-seq technologies improve, their use for solid tissues, such as liver, has expanded [ 17 , 18 ].…”
Section: Introductionmentioning
confidence: 99%