2022
DOI: 10.1371/journal.pone.0262181
|View full text |Cite
|
Sign up to set email alerts
|

A deep neural network model for multi-view human activity recognition

Abstract: Multiple cameras are used to resolve occlusion problem that often occur in single-view human activity recognition. Based on the success of learning representation with deep neural networks (DNNs), recent works have proposed DNNs models to estimate human activity from multi-view inputs. However, currently available datasets are inadequate in training DNNs model to obtain high accuracy rate. Against such an issue, this study presents a DNNs model, trained by employing transfer learning and shared-weight techniqu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 57 publications
0
2
0
Order By: Relevance
“…Convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have recently emerged as promising candidates for human recognition thanks to advancements in deep learning [7]. These methods can learn hierarchical representations of images and videos that can be used to recognize activities.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have recently emerged as promising candidates for human recognition thanks to advancements in deep learning [7]. These methods can learn hierarchical representations of images and videos that can be used to recognize activities.…”
Section: Related Workmentioning
confidence: 99%
“…Human interaction recognition recognizes and classifies human activities using visual data, such as images or videos, to recognize and organize them. Regarding recognizing human-human interactions (HHI) in computer vision, researchers have proposed various methods that use machine learning classifiers such as Random Forest, Support Vector Machine (SVM), Decision Trees, and HMM to recognize and categorize human interactions, such as shake_hands, hugs, and high-fives, into distinctly different classes [7,8]. These methods extract handcrafted elements, including spatialtemporal information, posture, and gesture, from images or videos and use classifiers to recognize and classify interactions.…”
Section: Image-based Hirmentioning
confidence: 99%
“…Deep learning is advantageous in the sense that it performs automatic feature extraction, which is particularly useful when dealing with challenging datasets. The paper [85], proposed a Deep Neural Network (DNN) consisting of a combination of Convolutional Neural Networks (CNN), attention layers, LSTM, and softmax layers to improve the accuracy of predictions and for short-term predicting. The attention layer selectively concentrates on the important parts of the input sequence, while the softmax is the activation function that normalizes the outputs.…”
Section: The Approaches Used In Hap/harmentioning
confidence: 99%
“…In this work, we take inspiration from the multi-view human action recognition (MVHAR) field to address the problem of animal behavior recognition, particularly the WDS behavior in rats. There are two main approaches for MVHAR; the first one is to train end-to-end neural networks ( Putra et al, 2018 , 2022 ; Wang et al, 2018 ; Vyas et al, 2020 ), but the performance is tied to the use of large MVHAR datasets ( Weinland et al, 2006 ; Gkalelis et al, 2009 ; Wang et al, 2014 ; Shahroudy et al, 2016 ; Guo et al, 2022 ). Equivalent animal datasets are not available, and producing them would be costly.…”
Section: Introductionmentioning
confidence: 99%