Review on recent Computer Vision Methods for Human Action Recognition

Muhamada, Azhee Wria; Mohammed, Aree Ali

doi:10.14201/adcaij2021104361379

Cited by 4 publications

(2 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The problem of vanishing or exploding gradients occurs. In practical situations or in Real Time, conventional recurrent neural networks are incapable of handling long-term dependencies in Recurrent Teaching and Learning (RTRL) [15]. Dense Trajectories, which consist of Histograms of Oriented (HOG), Histogram of Optical Flow (HOF), and Motion Boundary Histograms (MBH), have recently been identi ed as a successful method.…”

Section: Relatedworkmentioning

confidence: 99%

See 1 more Smart Citation

A comparative study using improved LSTM /GRU for human action recognition

muhamad

Mohammed

2022

Preprint

View full text Add to dashboard Cite

One of the deep learning algorithms for sequence data analysis is a recurrent neural network (RNN). In a conventional neural network, the inputs and the outputs are independent of each other. At the same time, RNN is considered a type of Neural Network where the output from the previous step feeds information to the current phase. It has many applications, including video sentiment classification, speech tagging, and machine translation. Recurrent networks are also distributed parameters across each layer of the network. Several layers are stacked together to increase depth in forwarding and backward information of long short-term memory (LSTM) and Gated Recurrent Unit (GRU). This paper proposes two models for various action recognitions using LSTM and GRU, respectively. The first model was improved by increasing the LSTM layers to four and the number of units in each layer to 128 cells. While in the second model, GRU layers were extended to two layers with 128 cells, and the (update and reset) gates are modified based on the previous and current input. A comparative study was conducted during the experimental tests performed on the UCF101 action dataset regarding the accuracy rate for both models. Test results indicate that the accuracy has a significant improvement compared with other state-of-the-arts action recognitions, which are 95.19% and 92.9% for both improved LSTM and GRU, respectively.

show abstract

Section: Relatedworkmentioning

confidence: 99%

“…The train and test splits are utilized for action recognition on UCF101 under the literature to guarantee that video clips from the same lm have not been used for training and testing. Because current algorithms often attain the 95th percentile or superior accuracy, the data sets do not adequately simplify actual data [15].…”

Section: A Ucf101 Datasetmentioning

confidence: 99%