“…The first group is a group of the methods most related to ours, including five Lie group representation-based algorithms: Lie group using DTW [4] (Lie group-DTW), Lie group with TSRVF [45] (Lie group-TSRVF), Lie group with TSRVF and using PCA for dimensionality reduction [44] (Lie group-TSRVF-PCA), Lie group with TSRVF and K-SVD for sparse coding [59] (Lie group-TSRVF-KSVD), and the Lie group with deep learning (LieNet) [51], as well as two TSRVFrelated methods, the body part features with SRV and knearest neighbors clustering [47] (SRV-KNN), and TSRVF on Kendall's shape [5] (Kendall-TSRVF). In addition, two recent manifold-based methods, namely the Kendall-SCDL [49] and Gramian matrices [54], are compared. The methods in the second group are based on classic feature representations, like HOJ3D [12], EigenJoints [13], actionlet ensemble (Actionlet) [17], HON4D [15], discriminative key-frames (Key-frames) [14], RVV with DTW (RVV-DTW) [10], and spatio-temporal naive Bayes nearest-neighbor (ST-NBNN) [25].…”