“…To propagate temporal information across frames, one approach suggested in a study (Hossain and Little, 2018) involves employing a sequence-to-sequence model, while another study introduces the use of a spatial-temporal graph. Video input is also utilized to exploit temporal features , Chen et al, 2021a, Kocabas et al, 2020, Ye et al, 2023, Zhao et al, 2021, but in this case, a frame-to-frame real-time operation is not feasible due to the heavy structural system or the necessity of future information. Han et al (Han et al, 2020(Han et al, , 2022) demonstrate an integrated handtracking system within the Oculus Quest VR headset, utilizing the available inputs that depend on the hardware and proposed a regression network using tracking history.…”