2021
DOI: 10.1101/2021.03.31.437978
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Chromatin interaction aware gene regulatory modeling with graph attention networks

Abstract: Linking distal enhancers to genes and modeling their impact on target gene expression are longstanding unresolved problems in regulatory genomics and critical for interpreting non-coding genetic variation. Here we present a new deep learning approach called GraphReg that exploits 3D interactions from chromosome conformation capture assays in order to predict gene expression from 1D epigenomic data or genomic DNA sequence. By using graph attention networks to exploit the connectivity of distal elements and prom… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
35
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(35 citation statements)
references
References 29 publications
0
35
0
Order By: Relevance
“…Previous approaches typically modelled enhancer sequences explicitly via predefined sets of features, which were informed by prior biological knowledge 43 . In contrast, deep learning, in particular convolutional neural networks, do not require prior knowledge and can learn accurate models directly from raw data [44][45][46][47][48][49][50][51][52][53] . Once trained on raw data, these models allow the extraction and interpretation of the learned rules by novel types of tools 44,45,47,48,[54][55][56][57][58][59][60] .…”
mentioning
confidence: 99%
“…Previous approaches typically modelled enhancer sequences explicitly via predefined sets of features, which were informed by prior biological knowledge 43 . In contrast, deep learning, in particular convolutional neural networks, do not require prior knowledge and can learn accurate models directly from raw data [44][45][46][47][48][49][50][51][52][53] . Once trained on raw data, these models allow the extraction and interpretation of the learned rules by novel types of tools 44,45,47,48,[54][55][56][57][58][59][60] .…”
mentioning
confidence: 99%
“…In contrast, sequence-based methods such as 1D ResNet often focus on local sequence patterns and are not good at capturing interactions of residues that are far away from each other along the primary sequence. Studies have shown that explicitly modeling long-range residue interactions can greatly improve performance in various tasks [38] in addition to functional prediction. Using predicted structural information may also improve the prediction accuracy of GO terms with high specificity.…”
Section: Explicit Structural Information Amends Function Interpretation In Longer Sequences and High-specificity Go Termsmentioning
confidence: 99%
“…Graph attention network(GAT) [36] is a type of graph neural network (GNN) that performs graph convolution with self-attention [37]. GAT and GNN are used to model gene expression and study protein structure refinement [38,39]. Unsupervised protein sequence models are used to capture inter-residue relationships for protein contact prediction and have become an integral part of many protein structure prediction methods [23,24,32].…”
Section: Introductionmentioning
confidence: 99%
“…In the field of transcription regulation it has for example been shown that predicting experimental transcription factor binding profiles can elucidate the joint binding syntax of the pluripotency TFs (Avsec et al, 2021b) and that predicting gene expression can reveal gene-regulatory features such as long-range enhancer-promoter interactions (Avsec et al, 2021a;Karbalayghareh et al, 2021).…”
Section: Discussionmentioning
confidence: 99%
“…The model that ultimately solves this defining challenge will have to be able to predict gene expression up to measurement errors from sequence alone in any cell type, possibly even across organisms. Recently the first papers have been published that among other features predict gene expression from sequence in a multitask learning objective (Avsec et al, 2021a;Karbalayghareh et al, 2021). Until now, these proposed methods predict gene expression only for specific cell types.…”
Section: Discussionmentioning
confidence: 99%