2019
DOI: 10.1371/journal.pone.0218073
|View full text |Cite
|
Sign up to set email alerts
|

Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively parallel reporter assays

Abstract: The relationship between noncoding DNA sequence and gene expression is not well-understood. Massively parallel reporter assays (MPRAs), which quantify the regulatory activity of large libraries of DNA sequences in parallel, are a powerful approach to characterize this relationship. We present MPRA-DragoNN, a convolutional neural network (CNN)-based framework to predict and interpret the regulatory activity of DNA sequences as measured by MPRAs. While our method is generally applicable to a variety of MPRA desi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
69
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 78 publications
(71 citation statements)
references
References 67 publications
2
69
0
Order By: Relevance
“…Thus, while cis effects can generally be attributed to the disruption of strong activating motifs, in rarer cases, cis effects are due to the disruption of weak repressive motifs. While this may reflect real biological effects, it may also be due to the fact that MPRAs are more powered to detect activators over repressors [ 29 , 32 ].
Fig.
…”
Section: Resultsmentioning
confidence: 99%
“…Thus, while cis effects can generally be attributed to the disruption of strong activating motifs, in rarer cases, cis effects are due to the disruption of weak repressive motifs. While this may reflect real biological effects, it may also be due to the fact that MPRAs are more powered to detect activators over repressors [ 29 , 32 ].
Fig.
…”
Section: Resultsmentioning
confidence: 99%
“…Deep learning has been widely deployed in genomics and systems biology over the last few years ( Alipanahi et al , 2015 ; Avsec et al , 2019 ; Celesti et al , 2017 ; Cuperus et al , 2017 ; Greenside et al , 2018 ; Koh et al , 2017 ; Libbrecht and Noble, 2015 ; Movva et al , 2019 ; Nair et al , 2019 ; Pouladi et al , 2015 ; Rui et al , 2007 ; Shen et al , 2018 ). Many of the developed tools have been highly successful in classification problems such as the identification of binding sites, open regions of chromatin and the location of enhancers.…”
Section: Introductionmentioning
confidence: 99%
“…While recent work such as soft explainable decision trees 34 have enabled researchers to look inside this "black-box" by using a neural network to train a decision tree, we chose to visualize weights and activations of our trained model directly 35 . Taking inspiration from work in the fields of image recognition and genomics 23,[36][37][38] , we "un-boxed" the first convolutional layer to visualize the features our model deemed important by interpreting the filter weights learned from input sequences as sequence logos ( Fig. 2a).…”
Section: Resultsmentioning
confidence: 99%