Improving representations of genomic sequence motifs in convolutional networks with exponential activations

Koo, Peter K.; Ploenzke, Matt

doi:10.1101/2020.06.14.150706

Cited by 24 publications

(44 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To gain insights into what DNN-based methods have learned, DLPRB visualizes filter representations while cDeepbind employs in silico mutagenesis. Filter representations are sensitive to network design choices [ 29 , 30 ]; ResidualBind is not designed with the intention of learning interpretable filters. Hence, we opted to employ in silico mutagenesis, which systematically probes the effect size that each possible single nucleotide mutation in a given sequence has on model predictions.…”

Section: Resultsmentioning

confidence: 99%

“…For RBPs, this has been accomplished by visualizing first convolutional layer filters and via attribution methods [ 13 , 18 , 23 , 24 ]. First layer filters have been shown to capture motif-like representations, but their efficacy depends highly on choice of model architecture [ 29 ], activation function [ 30 ], and training procedure [ 31 ]. First-order attribution methods, including in silico mutagenesis [ 13 , 32 ] and other gradient-based methods [ 19 , 33 – 36 ], are interpretability methods that identify the independent importance of single nucleotide variants in a given sequence toward model predictions—not the effect size of extended patterns such as sequence motifs.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Global importance analysis: An interpretability method to quantify importance of genomic features in deep neural networks

et al. 2021

Self Cite

View full text Add to dashboard Cite

Deep neural networks have demonstrated improved performance at predicting the sequence specificities of DNA- and RNA-binding proteins compared to previous methods that rely on k-mers and position weight matrices. To gain insights into why a DNN makes a given prediction, model interpretability methods, such as attribution methods, can be employed to identify motif-like representations along a given sequence. Because explanations are given on an individual sequence basis and can vary substantially across sequences, deducing generalizable trends across the dataset and quantifying their effect size remains a challenge. Here we introduce global importance analysis (GIA), a model interpretability method that quantifies the population-level effect size that putative patterns have on model predictions. GIA provides an avenue to quantitatively test hypotheses of putative patterns and their interactions with other patterns, as well as map out specific functions the network has learned. As a case study, we demonstrate the utility of GIA on the computational task of predicting RNA-protein interactions from sequence. We first introduce a convolutional network, we call ResidualBind, and benchmark its performance against previous methods on RNAcompete data. Using GIA, we then demonstrate that in addition to sequence motifs, ResidualBind learns a model that considers the number of motifs, their spacing, and sequence context, such as RNA secondary structure and GC-bias.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Global importance analysis: An interpretability method to quantify importance of genomic features in deep neural networks

et al. 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…To gain insights into what DNN-based methods have learned, DLPRB visualizes filter representations while cDeepbind employs in silico mutagenesis. Filter representations are sensitive to network design choices 29,30 ; ResidualBind is not designed with the intention of learning interpretable filters. Hence, we opted to employ in silico mutagenesis, which systematically probes the effect size that each possible single nucleotide mutation in a given sequence has on model predictions.…”

Section: Resultsmentioning

confidence: 99%

“…For RBPs, this has been accomplished by visualizing first convolutional layer filters and via attribution methods 13,18,23,24 . First layer filters have been shown to capture motif-like representations, but their efficacy depends highly on choice of model architecture 29 , activation function 30 , and training procedure 31 . First-order attribution methods, including in silico mutagenesis 13,32 and other gradient-based methods 19,33–36 , are interpretability methods that identify the independent importance of single nucleotide variants in a given sequence toward model predictions – not the effect size of extended patterns such as sequence motifs.…”

Section: Introductionmentioning

confidence: 99%

Global Importance Analysis: An Interpretability Method to Quantify Importance of Genomic Features in Deep Neural Networks

Koo

Ploenzke

Paul

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Deep neural networks have demonstrated improved performance at predicting the sequence specificities of DNA- and RNA-binding proteins compared to previous methods that rely on k-mers and position weight matrices. For model interpretability, attribution methods have been employed to reveal learned patterns that resemble sequence motifs. First-order attribution methods only quantify the independent importance of single nucleotide variants in a given sequence – it does not provide the effect size of motifs (or their interactions with other patterns) on model predictions. Here we introduce global importance analysis (GIA), a new model interpretability method that quantifies the population-level effect size that putative patterns have on model predictions. GIA provides an avenue to quantitatively test hypotheses of putative patterns and their interactions with other patterns, as well as map out specific functions the network has learned. As a case study, we demonstrate the utility of GIA on the computational task of predicting RNA-protein interactions from sequence. We first introduce a new convolutional network, we call ResidualBind, and benchmark its performance against previous methods on RNAcompete data. Using GIA, we then demonstrate that in addition to sequence motifs, ResidualBind learns a model that considers the number of motifs, their spacing, and sequence context, such as RNA secondary structure and GC-bias.

show abstract

“…In comparison to most backpropagation-based methods that often use heuristic rules and approximations, ISM faithfully represents the model's response to mutations at individual positions. This makes it the method of choice when evaluating the effect of genetic variants on the output (Zhou and Troyanskaya, 2015;Zhou et al, 2018;Wesolowska-Andersen et al, 2020), and it is also used as a benchmark reference when evaluating fidelity of other feature attribution methods (Koo and Ploenzke, 2020). Unlike ISM, backpropagation-based methods like DeepLIFT and Integrated Gradients rely on a predefined set of "neutral" input sequences that are used as explicit references to estimate attribution scores.…”

Section: Introductionmentioning

confidence: 99%

fastISM: Performant in-silico saturation mutagenesis for convolutional neural networks

Nair

Shrikumar

Kundaje

2020

Preprint

View full text Add to dashboard Cite

Deep learning models such as convolutional neural networks are able to accurately map biological sequences to associated functional readouts and properties by learning predictive de novo representations. In-silico saturation mutagenesis (ISM) is a popular feature attribution technique for inferring contributions of all characters in an input sequence to the model's predicted output. The main drawback of ISM is its runtime, as it involves multiple forward propagations of all possible mutations of each character in the input sequence through the trained model to predict the effects on the output. We present fastISM, an algorithm that speeds up ISM by a factor of over 10x for commonly used convolutional neural network architectures. fastISM is based on the observations that the majority of computation in ISM is spent in convolutional layers, and a single mutation only disrupts a limited region of intermediate layers, rendering most computation redundant. fastISM reduces the gap between backpropagation-based feature attribution methods and ISM. It far surpasses the runtime of backpropagation-based methods on multi-output architectures, making it feasible to run ISM on a large number of sequences. An easy-to-use Keras/TensorFlow 2 implementation of fastISM is available at https://github.com/kundajelab/fastISM, and a hands-on tutorial at https://colab.research.google.com/github/kundajelab/fastISM/blob/master/notebooks/colab/DeepSEA.ipynb.

show abstract

Improving representations of genomic sequence motifs in convolutional networks with exponential activations

Cited by 24 publications

References 44 publications

Global importance analysis: An interpretability method to quantify importance of genomic features in deep neural networks

Global importance analysis: An interpretability method to quantify importance of genomic features in deep neural networks

Global Importance Analysis: An Interpretability Method to Quantify Importance of Genomic Features in Deep Neural Networks

fastISM: Performant in-silico saturation mutagenesis for convolutional neural networks

Contact Info

Product

Resources

About