“…This empirically highlights why pose estimation is a great summary of such video data. Which keypoints should be extracted, of course, dramatically depends on the model organism and the goal of the study (e.g., many are required for dense, 3D models) (G€ uler et al, 2018;Sanakoyeu et al, 2020;Zuffi et al, 2016), whereas a single point can suffice for analyzing some behaviors . One of the great advantages of deep learning-based methods is that they are very flexible, and the user can define what should be tracked.…”