2021
DOI: 10.1016/j.patrec.2021.02.013
|View full text |Cite
|
Sign up to set email alerts
|

Action recognition using kinematics posture feature on 3D skeleton joint locations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 47 publications
(16 citation statements)
references
References 23 publications
0
16
0
Order By: Relevance
“…As can be seen from Figure 1 , the deep learning-based sport recognition model for mental health status is composed of a data layer, a logic layer, and a display layer. The data layer is used to obtain people’s sport data, real-time sport data, and health identification data and transmit the above data into the logic layer ( Ahad et al, 2021 ). After using the logic layer to perform mutual information feature extraction, exercise health pattern recognition, and exercise intensity logic calculation on people’s exercise data and health data, the relevant calculation results are inputted into the display layer to provide users with exercise project management, exercise intensity viewing, and exercise health model and other information, so that users can fully understand the effect of current exercise on their mental health.…”
Section: Recognition and Analysis Of Sports On Mental Healthmentioning
confidence: 99%
“…As can be seen from Figure 1 , the deep learning-based sport recognition model for mental health status is composed of a data layer, a logic layer, and a display layer. The data layer is used to obtain people’s sport data, real-time sport data, and health identification data and transmit the above data into the logic layer ( Ahad et al, 2021 ). After using the logic layer to perform mutual information feature extraction, exercise health pattern recognition, and exercise intensity logic calculation on people’s exercise data and health data, the relevant calculation results are inputted into the display layer to provide users with exercise project management, exercise intensity viewing, and exercise health model and other information, so that users can fully understand the effect of current exercise on their mental health.…”
Section: Recognition and Analysis Of Sports On Mental Healthmentioning
confidence: 99%
“…Early Feature Fusion. Although neural networks can autonomously learn data features, many studies have indicated that early feature processing can improve the performances of models, so it is necessary to select distinctive features [44][45][46][47][48]. For example, inspired by the Lie group-based skeleton descriptor [44], Jiang et al [16] proposed a spatiotemporal skeleton transformation descriptor (ST-STD) to define the relative transformations of skeleton gestures, including rotation and translation during skeleton movement.…”
Section: Point Levelmentioning
confidence: 99%
“…For example, inspired by the Lie group-based skeleton descriptor [44], Jiang et al [16] proposed a spatiotemporal skeleton transformation descriptor (ST-STD) to define the relative transformations of skeleton gestures, including rotation and translation during skeleton movement. Ahad et al [45] used the linear joint position feature (LJPF) and angular joint position feature (AJPF) obtained based on the three-dimensional linear joint positions and angles between skeleton segments as distinctive features. Nie et al [9] proposed two new viewpointinvariant motion features: the Euler angle of joints (JEAs) and the Euclidean distance matrix between joints (JEDM).…”
Section: Point Levelmentioning
confidence: 99%
“…The number of classes defined varied across 9 to 18, depending on the dataset used. The SVM classifier was trained with a linear kernel function obtaining, for each dataset, the following results in terms of accuracy and precision: 93.91%, 97.51%, 74.78%, 71.58%, and 94.92%, respectively [ 20 ].…”
Section: Introductionmentioning
confidence: 99%
“…The first deep model was composed of two LSTM layers, the second one was arranged with one CNN layer followed by an LSTM network (CNNRNN), and the last model was organized with two CNN networks and an LSTM network for the last layer (ConvRNN). The best model for all the datasets used was the ConvRNN architecture, which obtained accuracies ranging 94.7% and 98.1% [ 20 ]. Zhu et al proposed a new spatial model with end-to-end bidirectional LSTM-CNN (BLSTM-CNN).…”
Section: Introductionmentioning
confidence: 99%