“…In some cases, top-1, top-5, and top-10 accuracy were calculated, expressing the model's ability to identify 'most likely' candidates rather than one correct answer. A BLUE score was used to assess the quantitative output of translation models with values between 0 and 100 as depicted in Table 15, while qualitative analysis was based on comparison with ground RGB video [185] Kinect [189] Video [157] RGB image extracted from video [191] Video, Kinect [190] Video [187] RGB video, depth video, 3D skeletal data, facial features [41] RGB video, Kinect, 3D skeletal data [195] Kinect, RGB image, skeletal data [50] RGB video [49] RGB, Kinect, Skeleton point data [128] Infrared [133] RGB [66] RGB [3] RGB [37] RGB Video [49] RGB, depth, skeleton [193] Video [68] NA [130] RGB, Kinect [69] RGB Video [70] RGB Video [47] RGB Video [67] RGB, Kinect [158] RGB from two angles, Video RGB video [185] Kinect [189] Video [157] RGB image extracted from video [191] Video, Kinect [190] Video [187] RGB video, depth video, 3D skeletal data, facial features [41] RGB video, Kinect, 3D skeletal data [195] Kinect, RGB image, skeletal data [50] RGB video [49] RGB, Kinect, Skeleton point data [128] Infrared [133] RGB [66] RGB [3] RGB …”