“…Today, with the development of computer hardware and the advent of deep learning, researchers have become equipped with novel tools, the most prominent of which are various convolutional networks (CNN). There have been many published CNN-based researches on hand detection such as YOLOv1 [ 24 ], YOLOv2 [ 25 ], YOLOv3 [ 26 ], YOLOv4 [ 27 ], YOLOv5 [ 14 , 28 ], YOLOv7 [ 9 ], Mask R-CNN [ 29 , 30 ], SSD [ 31 ], MobileNetv3 [ 32 ], etc. Some of the most prominent results are shown in Figure 1 of Wang et al’s work [ 9 ], where YOLOv7 achieved the best results in terms of accuracy and speed.…”