“…saliency maps ( Simonyan et al, 2013 ) and 4) higher-order interactions among sequence elements, which can be assessed e.g. by using association rule analysis ( Naulaerts et al, 2015 ; Zrimec et al, 2020 ), second-order perturbations ( Koo et al, 2018 ), self-attention networks ( Ullah and Ben-Hur, 2020 ) or by visualizing kernels in deeper layers ( Maslova et al, 2020 ) [interested readers are referred to ( Eraslan et al, 2019a ; Koo and Ploenzke, 2020a )]. Moreover, attention mechanisms were recently shown to be more effective in discovering known TF-binding motifs compared to non-attentive DNNs ( Park et al, 2020 ), as the learned attention weights correlate with informative inputs, such as DNase-Seq coverage and DNA motifs ( Chen et al, 2021 ), and they can provide better interpretation than other established feature visualization methods, such as saliency maps ( Lanchantin et al, 2016 ; Singh et al, 2017 ).…”