2021
DOI: 10.1155/2021/8890808
|View full text |Cite
|
Sign up to set email alerts
|

Attention‐Based Temporal Encoding Network with Background‐Independent Motion Mask for Action Recognition

Abstract: Convolutional neural network (CNN) has been leaping forward in recent years. However, the high dimensionality, rich human dynamic characteristics, and various kinds of background interference increase difficulty for traditional CNNs in capturing complicated motion data in videos. A novel framework named the attention-based temporal encoding network (ATEN) with background-independent motion mask (BIMM) is proposed to achieve video action recognition here. Initially, we introduce one motion segmenting approach o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 47 publications
(90 reference statements)
0
1
0
Order By: Relevance
“…Tu et al [ 55 ] proposed a combination of video object detection and motion saliency detection methods, which are based on pre-trained models from other datasets with extra labels to form a multi-stream neural network for action recognition. Weng et al [ 56 ] utilized boundaries and optical flow to generate background-independent motion masks for action recognition.…”
Section: Related Workmentioning
confidence: 99%
“…Tu et al [ 55 ] proposed a combination of video object detection and motion saliency detection methods, which are based on pre-trained models from other datasets with extra labels to form a multi-stream neural network for action recognition. Weng et al [ 56 ] utilized boundaries and optical flow to generate background-independent motion masks for action recognition.…”
Section: Related Workmentioning
confidence: 99%