2019
DOI: 10.1101/829481
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

AtacWorks: A deep convolutional neural network toolkit for epigenomics

Abstract: We introduce AtacWorks ( https://github.com/clara-genomics/AtacWorks ), a method to denoise and identify accessible chromatin regions from low-coverage or low-quality ATAC-seq data. AtacWorks uses a deep neural network to learn a mapping between noisy ATAC-seq data and corresponding higher-coverage or higher-quality data. To demonstrate the utility of AtacWorks, we train a model on data from four blood cell types and show that this model accurately denoises and identifies peaks from low-coverage bulk sequencin… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
25
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(26 citation statements)
references
References 33 publications
1
25
0
Order By: Relevance
“…In line with the previous assessment of a single punch from the mouse SSp, cell type separation can be distinct for major cell types when leveraging larger peak sets than the limited number that can be called on small cell count datasets. This supports the assertion that computational improvements to enable peak calling on low cell count datasets can substantially boost analytical power 29 .…”
Section: Spatial Trajectories Of Single-cell Atac-seq In the Human Cosupporting
confidence: 77%
“…In line with the previous assessment of a single punch from the mouse SSp, cell type separation can be distinct for major cell types when leveraging larger peak sets than the limited number that can be called on small cell count datasets. This supports the assertion that computational improvements to enable peak calling on low cell count datasets can substantially boost analytical power 29 .…”
Section: Spatial Trajectories Of Single-cell Atac-seq In the Human Cosupporting
confidence: 77%
“…We also achieve nearly linear scaling with our implementation on multiple sockets of Intel ® Xeon ® Cascade/Cooper/Ice Lake CPUs. We demonstrate that our execution on multiple CPU sockets is significantly faster than the published results for DGX-1 [20] box with 8 V100s [2] without any loss of accuracy. For a fair comparison with DGX-1 box, we use CPU systems with similar power envelop.…”
Section: Introductionmentioning
confidence: 73%
“…Subsequently, we scale our experiments by increasing the number of CPU sockets, dataset size, and ATAC-seq signal track size. We show multi-socket CPU scaling results and compare them with multi-GPU results published in [2].…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Our work demonstrates the feasibility of training such complex models (thousands of free parameters) on limited data sets (hundreds rather than thousands of samples), and we have tested that it can handle other data sets of much larger size (tens of thousands of samples, data not shown). It stands in contrast to other available implementations of neural networks for regulatory genomics, which are targeted to modeling epigenomic (39, 59, 60) and cistromic (36, 38) data, or do not explicitly model the dependence of sequence function on cellular descriptors such as TF levels (61). This feature allows CoNSEPT to make predictions for varying cellular conditions.…”
Section: Discussionmentioning
confidence: 99%