2020
DOI: 10.1186/s13059-020-02032-0
|View full text |Cite
|
Sign up to set email alerts
|

Sampling time-dependent artifacts in single-cell genomics studies

Abstract: Robust protocols and automation now enable large-scale single-cell RNA and ATAC sequencing experiments and their application on biobank and clinical cohorts. However, technical biases introduced during sample acquisition can hinder solid, reproducible results, and a systematic benchmarking is required before entering large-scale data production. Here, we report the existence and extent of gene expression and chromatin accessibility artifacts introduced during sampling and identify experimental and computationa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
64
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 67 publications
(68 citation statements)
references
References 40 publications
4
64
0
Order By: Relevance
“…In an epigenetic analysis based on scATAC-seq, we analyzed the PBMC from SCPs, MPs, and HCs, taking three cases from each group. The clustering analysis identified 12 distinct clusters composed of T (CD3G), NK (NKG7), B (MS4A1), and monocyte cells (IL1B) by those signature genes ( 30 ) ( Figure 1A ). We further increased the resolution so that T cells were subclustered into CD8 + T cells and CD4 + T cells ( Figure 1A and Supplementary Figures 2A,B ).…”
Section: Resultsmentioning
confidence: 99%
“…In an epigenetic analysis based on scATAC-seq, we analyzed the PBMC from SCPs, MPs, and HCs, taking three cases from each group. The clustering analysis identified 12 distinct clusters composed of T (CD3G), NK (NKG7), B (MS4A1), and monocyte cells (IL1B) by those signature genes ( 30 ) ( Figure 1A ). We further increased the resolution so that T cells were subclustered into CD8 + T cells and CD4 + T cells ( Figure 1A and Supplementary Figures 2A,B ).…”
Section: Resultsmentioning
confidence: 99%
“…Our results encourage the use of mixed models, such as the two-part hurdle model with a random effect (e.g., as implemented in MAST with RE), as a way to account for repeated observations from an individual while being able to adjust for covariates at the individual level and, if appropriate, at the individual cell level. Additional random effects, such as sampling time, may also be included 35 . Our extensive simulation study provides valuable information for understanding the power of specific designs and can be used in grant reviews as one justification of the design and analyses employed.…”
Section: Discussionmentioning
confidence: 99%
“…We selected two PBMC datasets from the Ding datasets: the 10X human and the Dropseq human. We used PBMC datasets for this analysis because their cell-to-cell variability has been extensively studied using single-cell technologies as Fluorescence Activated Cell Sorting (FACS) and scRNA-seq [19, 20, 21, 22, 23]. Using these datasets, we measured the proportion of GO terms obtained in the output that were tightly related to the biological system under study.…”
Section: Resultsmentioning
confidence: 99%