“…Single person pose estimation in videos has also been studied extensively in the literature [28,9,46,33,46,20,44,29,13,18]. These approaches mainly aim to improve pose estimation by utilizing temporal smoothing constraints [28,9,44,33,13] and/or optical flow information [46,20,29], but they are not directly applicable to videos with multiple potentially occluding persons.…”