Visualizing complex feature interactions and feature sharing in genomic deep neural networks

Liu, Ge; Zeng, Haoyang; Gifford, David K.

doi:10.1186/s12859-019-2957-4

Cited by 28 publications

(21 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, attribution methods explored here are first-order interpretability methods, defining the importance of individual nucleotides -not the importance of the entire motif on model predictions. Although recent progress is extending these class of methods to second-order attributions 53,[56][57][58] , they cannot uncover the effect size of motifs on model predictions. Global interpretability analysis via in silico experiments is one avenue that shows great promise in uncovering the importance of whole features 59 .…”

Section: Discussionmentioning

confidence: 99%

Improving representations of genomic sequence motifs in convolutional networks with exponential activations

Koo

Ploenzke

2020

Preprint

View full text Add to dashboard Cite

Deep convolutional neural networks (CNNs) trained on regulatory genomic sequences tend to learn distributed feature representations across many filters, making it challenging to decipher biologically meaningful representations, such as sequence motifs. Here we perform a comprehensive analysis on synthetic sequences to investigate the role that CNN activations have on model interpretability. We introduce a novel application of the exponential activation that when applied to first layer filters, consistently leads to interpretable and robust representations of motifs compared to other commonly used activations, both qualitatively and quantitatively. Strikingly, we demonstrate that CNNs with better test performance do not necessarily imply more interpretable representations with attribution methods. We find that CNNs with exponential activations significantly improve the efficacy of a CNN's ability to recover biologically meaningful representations with attribution methods. We demonstrate these results generalize to real DNA sequences across several in vivo datasets. Together, this work demonstrates how a small modification to existing CNNs, i.e. setting exponential activations in the first layer, can significantly improve the robustness and interpretabilty of learned representations directly in convolutional filters and indirectly with attribution methods. 3/18 4/18

show abstract

Section: Discussionmentioning

confidence: 99%

Improving representations of genomic sequence motifs in convolutional networks with exponential activations

Koo

Ploenzke

2020

Preprint

View full text Add to dashboard Cite

show abstract

“…The major drawback of DFIM is that it is computationally expensive: the interactions are inferred in a separate post-processing step and involves recalculation of network gradients. We note that the recent DeepResolve method infers feature importance and whether a feature participates in interactions with other features, but does not infer pairs of interacting features explicitly [Liu et al, 2019].…”

Section: Introductionmentioning

confidence: 96%

A Self-Attention Model for Inferring Cooperativity between Regulatory Features

Ullah

Ben‐Hur

2020

Preprint

View full text Add to dashboard Cite

Motivation:Deep learning has demonstrated its predictive power in modeling complex biological phenomena such as gene expression. The value of these models hinges not only on their accuracy, but also on the ability to extract biologically relevant information from the trained models. While there has been much recent work on developing feature attribution methods that provide the most important features for a given sequence, inferring cooperativity between regulatory elements, which is the hallmark of phenomena such as gene expression, remains an open problem. Results: We present SATORI, a Self-ATtentiOn based model to predict Regulatory element Interactions. Our approach combines convolutional and recurrent layers with a self-attention mechanism that helps us capture a global view of the landscape of interactions between regulatory elements in a sequence. We evaluate our method on simulated data and three complex datasets: human TAL1-GATA1 transcription factor ChIP-Seq, DNase I Hypersensitive Sites (DHSs) in human promoters across 164 cell lines, and genome-wide DNase I-Seq and ATAC-Seq peaks across 36 arabidopsis samples. In each of the three experiments SATORI identified numerous statistically significant TF interactions, many of which have been previously reported. Our approach not only provides a global, biologically relevant set of interactions but, unlike existing methods, it does not require a computationally expensive postprocessing step. Availability: The source code for SATORI is available at https://github.com/fahadahaf/SATORI.

show abstract

“…Recent progress has expanded the ability to probe interactions between putative motifs 37–39 . For instance, MaxEnt Interpretation uses Markov Chain Monte Carlo to sample sequences that produce a similar activation profile in the penultimate layer of the DNN 37 , allowing for downstream analysis of these sequences.…”

Section: Introductionmentioning

confidence: 99%

Global Importance Analysis: An Interpretability Method to Quantify Importance of Genomic Features in Deep Neural Networks

Koo

Ploenzke

Paul

et al. 2020

Preprint

View full text Add to dashboard Cite

Deep neural networks have demonstrated improved performance at predicting the sequence specificities of DNA- and RNA-binding proteins compared to previous methods that rely on k-mers and position weight matrices. For model interpretability, attribution methods have been employed to reveal learned patterns that resemble sequence motifs. First-order attribution methods only quantify the independent importance of single nucleotide variants in a given sequence – it does not provide the effect size of motifs (or their interactions with other patterns) on model predictions. Here we introduce global importance analysis (GIA), a new model interpretability method that quantifies the population-level effect size that putative patterns have on model predictions. GIA provides an avenue to quantitatively test hypotheses of putative patterns and their interactions with other patterns, as well as map out specific functions the network has learned. As a case study, we demonstrate the utility of GIA on the computational task of predicting RNA-protein interactions from sequence. We first introduce a new convolutional network, we call ResidualBind, and benchmark its performance against previous methods on RNAcompete data. Using GIA, we then demonstrate that in addition to sequence motifs, ResidualBind learns a model that considers the number of motifs, their spacing, and sequence context, such as RNA secondary structure and GC-bias.

show abstract

Visualizing complex feature interactions and feature sharing in genomic deep neural networks

Cited by 28 publications

References 29 publications

Improving representations of genomic sequence motifs in convolutional networks with exponential activations

Improving representations of genomic sequence motifs in convolutional networks with exponential activations

A Self-Attention Model for Inferring Cooperativity between Regulatory Features

Global Importance Analysis: An Interpretability Method to Quantify Importance of Genomic Features in Deep Neural Networks

Contact Info

Product

Resources

About