“…Informative feature extractions mainly rely on the support vector machine [27,33], K-means clustering algorithm [33,34], or linear discriminant analysis [35], whereas the hidden Markov model was commonly used for human activity recognition [22,31,33,34]. To further increase the accuracy of posture recognition in both industry and academia, the image and inertial sensor fusion is a popular technique, performed by commercial equipment, the Microsoft Kinect [1,10,11,23]. Based on the proposed experimental schemes, the approaches could be categorized as the skeleton-joint-based approach [1,17,24,[28][29][30][31][32] and the silhouette-based approach [23,36].…”