“…Human pose estimation systems are used to extract features in fifteen papers [35, 36, 62, 69, 73, 79, 80, 84, 85, 88-90, 94, 95, 97]. The estimated poses can be the only inputs to the translation model [35,62,69,73,84,85,90,94,97], or they can augment other spatial or spatio-temporal features [36,79,80,88,89,95]. Often, the keypoints are used as a sign language representation directly.…”