2023
DOI: 10.1093/nar/gkad436
|View full text |Cite
|
Sign up to set email alerts
|

A generalizable framework to comprehensively predict epigenome, chromatin organization, and transcriptome

Abstract: Many deep learning approaches have been proposed to predict epigenetic profiles, chromatin organization, and transcription activity. While these approaches achieve satisfactory performance in predicting one modality from another, the learned representations are not generalizable across predictive tasks or across cell types. In this paper, we propose a deep learning approach named EPCOT which employs a pre-training and fine-tuning framework, and is able to accurately and comprehensively predict multiple modalit… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 49 publications
0
5
0
Order By: Relevance
“…Ablation experiments with ChromaFold demonstrated that co-accessibility from scATAC-seq gave a significant performance improvement over pseudobulk accessibility alone. While a number of models including EPCOT 40 and C.Origami have relied on bulk ATAC-seq as an input signal to help generalization across cell types, our results suggest that covariation in scATAC-seq provides additional information that can be leveraged for contact map prediction. ChromaFold prediction accuracy improved when cell-type-specific CTCF ChIP-seq data was provided as an input.…”
Section: Discussionmentioning
confidence: 89%
“…Ablation experiments with ChromaFold demonstrated that co-accessibility from scATAC-seq gave a significant performance improvement over pseudobulk accessibility alone. While a number of models including EPCOT 40 and C.Origami have relied on bulk ATAC-seq as an input signal to help generalization across cell types, our results suggest that covariation in scATAC-seq provides additional information that can be leveraged for contact map prediction. ChromaFold prediction accuracy improved when cell-type-specific CTCF ChIP-seq data was provided as an input.…”
Section: Discussionmentioning
confidence: 89%
“…We used a deep learning framework that can predict epigenome, chromatin organization and transcription (EPCOT) (Z. Zhang et al, 2023) to impute high-resolution 3D chromatin contacts (Micro-C) using the endothelial ATAC profile. This approach predicted high contacts of the caSNP-caPeak region with the INHBB gene TSS, nominating the gene as a target ( Figure.…”
Section: Resultsmentioning
confidence: 99%
“…We used EPCOT (Z. Zhang et al, 2023) to impute the high-resolution 3D chromatin contact maps. EPCOT is a computational framework that predicts multiple genomic modalities using chromatin accessibility profiles and the reference genome sequence as input.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…By leveraging the knowledge encoded within the pre-trained Sei model, we were able to quickly train models with higher accuracy than our earlier models; furthermore, by directly using the predicted chromatin state variables predicted by Sei, we were able to use a simple logistic regression approach that yielded a highly interpretable model enabling us to make direct connections between transcription factors and their involvement in intron retention and discover a much larger set of transcription factors involved in this process. The methodology used here is similar to the one employed by the authors of the recent EPCOT model [32]. EPCOT is a chromatin state foundation model that has the added feature of using chromatin accessibility as part of its input which provides better generalization across cell-types.…”
Section: Introductionmentioning
confidence: 99%