2019
DOI: 10.1093/comjnl/bxz123
|View full text |Cite
|
Sign up to set email alerts
|

Improving Human Action Recognition Using Hierarchical Features And Multiple Classifier Ensembles

Abstract: This paper presents a simple, fast and efficacious system to promote the human action classification outcome using the depth action sequences. Firstly, the motion history mages (MHIs) and static history images (SHIs) are created from the front (XOY), side (YOZ) and top (XOZ) projected scenes of each depth sequence in a 3D Euclidean space through engaging the 3D Motion Trail Model (3DMTM). Then, the Local Binary Patterns (LBPs) algorithm is operated on the MHIs and SHIs to learn motion and static hierarchical f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 59 publications
0
8
0
Order By: Relevance
“…Although the approach in Ref. [24] and our proposed approach both represent the properties of action sequences from three views, the two approaches are quite different. Specifically, the approach in Ref.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Although the approach in Ref. [24] and our proposed approach both represent the properties of action sequences from three views, the two approaches are quite different. Specifically, the approach in Ref.…”
Section: Related Workmentioning
confidence: 99%
“…Specifically, the approach in Ref. [24] aims to obtain motion history images and static history images by calculating the pixel change of the same position in a time period, and the three views in Ref. [1] are front (XOY), side (YOZ), and top (XOZ) plane in a 3D Euclidean space (X-Y-Z) at each frame of a depth map sequence.…”
Section: Related Workmentioning
confidence: 99%
“…Classification accuracy (%) DCSF [54] 89.3 HON4D [15] 88.9 Super Normal Vector [16] 93.1 Skeletons Lie group [17] 89.5 DMM-LBP-DF [55] 93.0 2D-CNN on DMM-Pyramid [44] 91.1 3D-CNN on DMM-Cube [44] 86.1 HOG3D + LLC [56] 90.9 Hierarchical 3D Kernel [57] 92.7 GLAC on DMM [13] 89.4 DMM-GLAC-STACOG [13] 94.8 3DHoT + MBC [58] 95.2 Subspace encoding [59] 94.06 LSTM + trust gates [60] 94.8 Extended SNV [61] 93.5 Trust Gates [62] 94.8 ST-NBNN [63] 94.8 SSTKDes [64] 95.6 3D-CNN + DHI + relief + SVM [65] 92.8 WDMM + HOG [66] 91.9 WDMM + LBP [66] 91.6 WDMM + CNN [66] 90.0 Deep activations [67] 92.3 Deep activations + attributes [67] 93.4 Hierarchical Gaussian [68] 95.6 GMHI + GSHI + CRC [69] 94.5 MHF + SHF + KELM [36] 95.97 Spatiotemporal + HMM [70] 92. Experimental evaluation of our approach on UTD-MHAD dataset is represented by Table 5.…”
Section: Approachmentioning
confidence: 99%
“…The system is evaluated on a Desktop whose configuration includes an Intel i5-7500 Quad-core processor of 3.41 GHz frequency and a 16 GB RAM. There are six major components in the system-i.e., MHI/SHI construction, binary coded MHI generation, binary coded SHI generation, EAMF [39] 92.4 DMPP-PHOG [39] 95.0 DMM-LBP-DF [55] 91.3 Multi-temporal DMM [71] 95.44 3DHoT + MBC [58] 96.69 Hierarchical Gaussian [68] 97.96 MHF + SHF + MSVM [36] 96.09 MHF + SHF + KELM [36] 98 7. Note that the system needs less than one second (i.e., 731.43 ± 48.83 ms) to process 40 depth frames.…”
Section: Computational Timementioning
confidence: 99%
See 1 more Smart Citation