“…Recent advances in artificial intelligence technology and vision sensors have promoted vision-based action recognition for various applications, such as education [ 2 ], entertainment [ 3 , 4 ], and sports [ 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 ]. Various studies have proposed novel algorithms [ 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 ] or established datasets [ 1 , 24 , 25 , 26 , 27 ] for vision-based action recognition.…”