2020
DOI: 10.1016/j.neunet.2019.09.029
|View full text |Cite
|
Sign up to set email alerts
|

Multi-label zero-shot human action recognition via joint latent ranking embedding

Abstract: Human action recognition refers to automatic recognizing human actions from a video clip, which is one of the most challenging tasks in computer vision. Due to the fact that annotating video data is laborious and time-consuming, most of the existing works in human action recognition are limited to a number of small scale benchmark datasets where there are a small number of video clips associated with only a few human actions and a video clip often contains only a single action. In reality, however, there often… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 42 publications
(15 citation statements)
references
References 58 publications
0
15
0
Order By: Relevance
“…Human action/gesture recognition from both images and videos is one of the most challenging tasks in computer vision, due to a variety of actions that are not available among the seen action categories. In this regard, GZSLbased frameworks have been employed to recognize single label [24], [135], [159] and multi-label [158] human actions. As an example, a new multi-label ZSL (MZSL) framework using JLRE (Joint Latent Ranking Embedding) has been proposed in [158].…”
Section: Applicationsmentioning
confidence: 99%
See 1 more Smart Citation
“…Human action/gesture recognition from both images and videos is one of the most challenging tasks in computer vision, due to a variety of actions that are not available among the seen action categories. In this regard, GZSLbased frameworks have been employed to recognize single label [24], [135], [159] and multi-label [158] human actions. As an example, a new multi-label ZSL (MZSL) framework using JLRE (Joint Latent Ranking Embedding) has been proposed in [158].…”
Section: Applicationsmentioning
confidence: 99%
“…In this regard, GZSLbased frameworks have been employed to recognize single label [24], [135], [159] and multi-label [158] human actions. As an example, a new multi-label ZSL (MZSL) framework using JLRE (Joint Latent Ranking Embedding) has been proposed in [158]. The relatedness score of various action labels is measured for the test video clips in the semantic embedding and joint latent visual spaces.In addition, a multimodal framework using audio, video, and text has been introduced in [160], [161].…”
Section: Applicationsmentioning
confidence: 99%
“…are the standard deviation. 1 C , 2 C and 3 C are constants. The structure similarity function of the images are generated by combining the three variables:…”
Section: Structural Similar Stepwise Generative Recognizable Networkmentioning
confidence: 99%
“…Human action recognition is a hot topic in computer vision and pattern recognition [1], which has been applied to various fields, such as medical health [2], intelligent space [3], interactive entertainment [4], robotics [5] and so on. The action information can be obtained from medical equipment to help patients with rehabilitation training.…”
Section: Introductionmentioning
confidence: 99%
“…Multi-label image classification has been a hot topic in computer vision community. Its extensive applications include but are not limited to image retrieval, automatic image annotation, web image search and image tagging [1,2,3,4,5].…”
Section: Introductionmentioning
confidence: 99%