2020
DOI: 10.1007/978-3-030-58571-6_6
|View full text |Cite
|
Sign up to set email alerts
|

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
158
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 149 publications
(158 citation statements)
references
References 45 publications
0
158
0
Order By: Relevance
“…The proposed approach is evaluated on FCVID using the mean average precision (mAP) and compared against the top-scoring approaches of the literature, i.e. PivotCorrNN [15], LiteEval [30], AdaFrame [31], SCSampler [17], ST-VLAD [22] and AR-Net [19]. On YLI-MED, the top-1 accuracy is utilized, and the comparison is performed against the top-scoring literature approaches for this dataset, i.e.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…The proposed approach is evaluated on FCVID using the mean average precision (mAP) and compared against the top-scoring approaches of the literature, i.e. PivotCorrNN [15], LiteEval [30], AdaFrame [31], SCSampler [17], ST-VLAD [22] and AR-Net [19]. On YLI-MED, the top-1 accuracy is utilized, and the comparison is performed against the top-scoring literature approaches for this dataset, i.e.…”
Section: Resultsmentioning
confidence: 99%
“…In [17], SCSampler uses a lightweight saliency model to select the most salient temporal clips within a long video. In [19], the adaptive resolution network (AR-Net) selects on-the-fly the optimal frame resolution for classifying the video, outperforming the other methods in the FCVID dataset. In contrast to C2D approaches, C3D ones learn the space and time information jointly by exploiting 3D convolutions.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Zheng et al [ 64 ] used reinforcement learning agents to select effective segments for inference. Meng et al [ 65 ] proposed to use reinforcement learning to select the optimal resolution for each frame in the video input for effective action recognition in long untrimmed videos.…”
Section: Related Workmentioning
confidence: 99%