2020
DOI: 10.1101/2020.02.03.930354
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SCSIM: Jointly simulating correlated single-cell and bulk next-generation DNA sequencing data

Abstract: Background: Recently, it has become possible to collect next-generation DNA sequencing data sets that are composed of multiple samples from multiple biological units where each of these samples may be from a single cell or bulk tissue. Yet, there does not yet exist a tool for simulating DNA sequencing data from such a nested sampling arrangement with single-cell and bulk samples so that developers of analysis methods can assess accuracy and precision. Results:We have developed a tool that simulates DNA sequenc… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 12 publications
(14 reference statements)
0
4
0
Order By: Relevance
“…For scalability analysis, simulated datasets are generated using the scsim Python package 56 , which is based on the Splatter statistical framework while it performs better efficiency to generate large scale simulated data. Following parameters are used in scsim() Python function: ngenes=25000, ncells=50000, 100000, 50000, 1000000, 2000000, 10000000, ngroups=5, diffexpprob= 0.025.…”
Section: Scalability Analysismentioning
confidence: 99%
“…For scalability analysis, simulated datasets are generated using the scsim Python package 56 , which is based on the Splatter statistical framework while it performs better efficiency to generate large scale simulated data. Following parameters are used in scsim() Python function: ngenes=25000, ncells=50000, 100000, 50000, 1000000, 2000000, 10000000, ngroups=5, diffexpprob= 0.025.…”
Section: Scalability Analysismentioning
confidence: 99%
“…There have been various read simulators built for single-cell genomic sequencing data. SimSCScTree[16], SCSIM[17], SCSsim[18] and CellCoal[19] are able to generate sequencing reads under single-cell resolution to benchmark analysis tools for single nucleotide variants(SNVs)[20] or copy number aberrations (CNAs)[21]. In contrast, few methods can simulate sequencing reads for single-cell epigenomic and transcriptomic data.…”
Section: Introductionmentioning
confidence: 99%
“…Yu et al developed SCSsim to produce SNVs, Indels, and CNVs, especially tackling the issue of allele dropout (ADO) and alleles unbalanced amplification frequently occurs in scDNA-seq [ 20 ]. SCSIM jointly mimics correlated single-cell and bulk DNA reads with SNVs [ 21 ]. Mallory et al developed SingleCellCNABenchmark which generates in silico single-cell reads with CNVs [ 22 ].…”
Section: Introductionmentioning
confidence: 99%
“…Evol. - - - - SCSsim [ 20 ] Bioinformatics - - SCSIM [ 21 ] BMC Bioinformatics - - - - SingleCellCNABenchmark [ 22 ] PLoS Comput. Biol - - - - SCSilicon - …”
Section: Introductionmentioning
confidence: 99%