“…However, these advancements in architecture-types have not addressed the issue of learning viewpointagnostic representation. Viewpoint-agnostic representation learning is drawing increasing attention in the vision community due to its wide range of downstream applications like 3D objectdetection [41],video alignment [6,16,17], action recognition [47,48], pose estimation [22,50], robot learning [24,26,43,45,49], and other tasks.…”