2019
DOI: 10.1109/tmm.2018.2887021
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Universal Attribute Modeling for Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(21 citation statements)
references
References 28 publications
0
21
0
Order By: Relevance
“…The best results of RGB-only models were R(2 + 1)D-RGB [53] and I3D-RGB [9], but they used more complex models with more params or FLOPs. We note that two-stream I3D [9] obtains 98.0% on UCF101 and 80.7% on HMDB51 top-1 accuracy, and some recent works on UCF101 [59] and HMDB51 [59], [60], [60]- [62] even achieve better results, but they are two-stream models with pre-computed optical flow, even more extra information such as audio signals.…”
Section: We Also Provide the Visualization Of Spatiotemporal Attentiomentioning
confidence: 77%
“…The best results of RGB-only models were R(2 + 1)D-RGB [53] and I3D-RGB [9], but they used more complex models with more params or FLOPs. We note that two-stream I3D [9] obtains 98.0% on UCF101 and 80.7% on HMDB51 top-1 accuracy, and some recent works on UCF101 [59] and HMDB51 [59], [60], [60]- [62] even achieve better results, but they are two-stream models with pre-computed optical flow, even more extra information such as audio signals.…”
Section: We Also Provide the Visualization Of Spatiotemporal Attentiomentioning
confidence: 77%
“…For the purpose of action recognition in surveillance scenes [57] proposes a Gaussian mixture model called Universal Attribute Modelling (UAM) using unsupervised learning approach. The UAM is also been used for facial expression recognition where it captures the attributes of all expressions [58] Further, for autonomous vehicles like cars or UAVs (Unmanned Aerial Vehicles) it is very essential to distinguish between normal and abnormal states.…”
Section: Unsupervised Learningmentioning
confidence: 99%
“…The overall framework should ensure that the nodes are not overloaded with work however if they are overloaded in the peak hours, the tasks should be partitioned and scheduled accordingly [106] [107]. Objects (Grassy Scene) AVIRIS [72] Objects (Aerial View) SkyEye [41] Objects (Aerial View) UCF101 [57] Individual (Action Videos) HMDB51 [57] Individual (Action Videos) THUMOS14 [57] [102] Individual (Action Videos) BP4D [58] Individual (Facial Expressions) AFEW [58] Individual (Facial Expressions) CAVIAR [67] Individuals, Objects (Vehicles) PASCAL VOC [6] [47] [69] [89] Objects (Vehicles) DS1, DS2 [8] Events (Fire/Smoke/Fog) SFpark [32] Object (Taxis) LISA 2010 [6] Objects (Vehicles) ImageNet [42] Events (Fire/Smoke/Fog) QMUL Junction [64] [49] Objects (Vehicles)…”
Section: Challenge 3: Quality Of Servicementioning
confidence: 99%
“…Action recognition is an important aspect of video content analysis and has been extensively studied over the last few years [16], [17], [18], [19], [20], [21]. Earlier methods are mostly based on hand-crafted visual features [22], [23].…”
Section: A Action Recognitionmentioning
confidence: 99%