Human pose and motion detection is an important area of computer vision research, covering the interplay of different fields such as image processing, pattern recognition, and artificial intelligence. Due to the complexity of human motion, existing 3D recognition and pose detection methods based on low-quality depth images are not very accurate and reliable. Due to the time-sensitive nature of features, a single feature cannot adapt to the dynamic changes of the scene, so it is difficult for the target tracking algorithm based on a single feature to achieve robust tracking results. If multiple features are fused and applied in the tracking algorithm, the complementarity between different features can be used to better adapt to the scene changes and achieve robust tracking results. In order to solve the problem of human pose and human motion recognition in low-quality depth images, this paper uses the Kinect somatosensory camera to obtain 20 human skeleton joint points through the Kinect skeleton tracking technology. This paper studies the typical human posture and interactive action recognition technology in daily life. On the basis of understanding the characteristics of skeletal data, this paper proposes a distance feature and angle feature model combined with human body structure. Through the experimental results, it is found that the distance feature and the angle feature value are basically not affected by the distance change. When the subjects turned 45° to the left and 45° to the right, the distance characteristics changed, which was different from the characteristic data when the subjects turned around.