“…Single-image inpainting methods [Bertalmio et al 2000;Criminisi et al 2004] are not designed to treat multi-view datasets even when they handle effects such as perspective [Huang et al 2014]. Recent work provides initial solutions to this problem [Baek et al 2016;Thonat et al 2016], but suffers from four limitations: 1) multi-view coherence is applied progressively across neighboring images and is often inaccurate or incomplete, 2) perspective effects are not correctly reproduced during inpainting, resulting in visual artifacts and blurring, 3) the quality of depth synthesis is insufficient and 4) the methods are not designed to handle large datasets, since they often use expensive algorithmic solutions operating on all images in the dataset. We target scenes containing man-made structures, corresponding to city blocks or apartments, containing up to hundreds of input images.…”