2019
DOI: 10.48550/arxiv.1908.01536
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Discriminating Spatial and Temporal Relevance in Deep Taylor Decompositions for Explainable Activity Recognition

Abstract: Current techniques for explainable AI have been applied with some success to image processing. The recent rise of research in video processing has called for similar work in deconstructing and explaining spatio-temporal models. While many techniques are designed for 2D convolutional models, others are inherently applicable to any input domain. One such body of work, deep Taylor decomposition, propagates relevance from the model output distributively onto its input and thus is not restricted to image processing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…This is exasperated as often there is no cohesion between frames leading to explanations produced this way to appear like noise. Hiley et al [11,12] have subsequently proposed a modification to these gradient-based techniques to make them more suitable to action recognition networks.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…This is exasperated as often there is no cohesion between frames leading to explanations produced this way to appear like noise. Hiley et al [11,12] have subsequently proposed a modification to these gradient-based techniques to make them more suitable to action recognition networks.…”
Section: Related Workmentioning
confidence: 99%
“…Subtracting the insertion score from 1 means that as both scores approach 0, the better they are. We compute the combined scores for w c = [18,20] and w g = [8,9,10,11,12]. These results are shown in Figure 2.…”
Section: Choice Of Weights For Swag-v I+gmentioning
confidence: 99%
“…Montavon et al proposed the layer-wise relevance propagation (LRP) to show the relevance between each pixel and the predicted class [7]; moreover, contrastive LRP further considers the effects of non-predicted classes to enhance the relevance to the targeted class [20]. Several works extend LRP for spatio temporal explainable methods (e.g., [3,21,22,43]). Hiley et al [21] surveyed a few frameworks that are used to explain action recognition models based on LRP or CAM approaches.…”
Section: Related Workmentioning
confidence: 99%
“…In this paper, we provide a deep analysis of temporal modeling for action recognition. Previous works focus on performance benchmark [10,54], spatiotemporal feature visualization [18,40] or salieny analysis [5,22,39,48] to gain better understanding of action models. For example, comprehensive studies of CNNbased models have been conducted recently in [10,54] to compare performance of different action models.…”
Section: Introductionmentioning
confidence: 99%