2022
DOI: 10.1101/2022.05.23.493129
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A generalizable framework to comprehensively predict epigenome, chromatin organization, and transcriptome

Abstract: Many deep learning approaches have been proposed to connect DNA sequence, epigenetic profiles, chromatin organization and transcription activities. While these approaches achieve satisfactory performance in predicting one modality from another, the representations learned are not generalizable across predictive tasks or across cell types. In this paper, we propose a deep learning approach named EPCOT which employs a pre-training and fine-tuning framework, and comprehensively predicts epigenome, chromatin organ… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(14 citation statements)
references
References 83 publications
1
6
0
Order By: Relevance
“…We observe that the majority of model performance derives from the assay token, which leads to an increase in Pearson's r of approximately 50%. This is consistent with the aforementioned inter-assay distribution shifts and with prior work opting to use just an assay token [3] (see Section 4). Additionally, overall performance degrades with the inclusion of a tissue token, regardless of whether an assay token is used.…”
Section: Cgn Generalises Across Individuals Tissues Assays and Long-r...supporting
confidence: 87%
See 4 more Smart Citations
“…We observe that the majority of model performance derives from the assay token, which leads to an increase in Pearson's r of approximately 50%. This is consistent with the aforementioned inter-assay distribution shifts and with prior work opting to use just an assay token [3] (see Section 4). Additionally, overall performance degrades with the inclusion of a tissue token, regardless of whether an assay token is used.…”
Section: Cgn Generalises Across Individuals Tissues Assays and Long-r...supporting
confidence: 87%
“…Although multiple recent models have focussed on epigenetic signals [25,4], the closest approach to ours is that of Epcot [3] (see Table 1), where multiple epigenetic assay types are predicted by cross-attending to assay embeddings. The authors also augment the fixed nature of the genome across cells with an accessibility signal (DNase-seq/ATAC-seq), which can be interpreted as a continuous cell type representation, while multitask pre-training followed by fine-tuning is used for optimisation.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations