“…Data augmentation is a core component of deep learning pipelines [41] that improves model robustness by applying transformations to the training data consistent with the data distribution in order to introduce desired equivariant properties. In computer vision and depth estimation in particular, standard data augmentation techniques are usually constrained to the 2D space and include color jittering, flipping, rotation, cropping, and resizing [12,58]. Recent works have started looking into 3D augmentations [38] to improve robustness to errors in scene geometry in terms of camera localization (i.e., extrinsics) and parameters (i.e., intrinsics).…”