2019
DOI: 10.1101/2019.12.20.883983
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Interpretable machine learning models for single-cell ChIP-seq imputation

Abstract: Single-cell ChIP-seq analysis is challenging due to data sparsity. We present SIMPA (https://github.com/salbrec/SIMPA), a single-cell ChIP-seq data imputation method leveraging predictive information within bulk ENCODE data to impute missing protein- DNA interacting regions of target histone marks or transcription factors. Machine learning models trained for each single cell, each target, and each genomic region enable drastic improvement in cell types clustering and genes identification.The discovery of prote… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 39 publications
0
2
0
Order By: Relevance
“…ChIP-seq (chromatin immunoprecipitation followed by sequencing) is a widely used technique for mapping transcription factors, histone changes, and other protein-DNA interactions for genome-wide mapping (Furey 2012). ChIP-seq data is suffering from sparsity hence Albretch et al introduced SIMPA (Albrecht, Andreani et al 2021), a Single-cell ChIP-seq imputation algorithm that was tested on a scChIP-seq dataset of the H3K4me3 and H3K27me3 histone marks in B-cells and T-cells. Unlike most SC imputation approaches, SIMPA integrates the sparse input of one single cell with a series of 2,251 ENCODE ChIP-seq experiments to extract predictive information from bulk ChIP-seq data.…”
Section: Histone Modificationmentioning
confidence: 99%
See 1 more Smart Citation
“…ChIP-seq (chromatin immunoprecipitation followed by sequencing) is a widely used technique for mapping transcription factors, histone changes, and other protein-DNA interactions for genome-wide mapping (Furey 2012). ChIP-seq data is suffering from sparsity hence Albretch et al introduced SIMPA (Albrecht, Andreani et al 2021), a Single-cell ChIP-seq imputation algorithm that was tested on a scChIP-seq dataset of the H3K4me3 and H3K27me3 histone marks in B-cells and T-cells. Unlike most SC imputation approaches, SIMPA integrates the sparse input of one single cell with a series of 2,251 ENCODE ChIP-seq experiments to extract predictive information from bulk ChIP-seq data.…”
Section: Histone Modificationmentioning
confidence: 99%
“…ChIP-seq data is very sparse, therefore generally requiring imputation for more accurate analysis. Albretch et al introduced Single-cell ChIP-seq iMPutAtion (SIMPA) (Albrecht, Andreani et al 2021), an imputation algorithm that was tested on a single-cell ChIP-seq (scChIP-seq) dataset of the H3K4me3 and H3K27me3 histone marks in B-cells and T-cells. Unlike most SC imputation approaches, SIMPA integrates the sparse input of one single cell with a series of 2,251 ENCODE ChIP-seq experiments to extract predictive information from bulk ChIP-seq data.…”
Section: Histone Modificationmentioning
confidence: 99%