“…Over the last decade, there has been a proliferation of deep learning (DL) approaches for feature extraction and matching (Chen et al, 2021;Jin et al 2021;Yao et al, 2021) that aim to overcome these limitations and they have demonstrated resilience against varying illumination conditions, multitemporal datasets, wide baselines, and significantly different view angles. Recently, several works have proved the effectiveness of DL approaches in challenging scenarios, including glacier monitoring with wide camera baselines (Ioli et al, 2023a, Ioli et al, 2023b, multi-temporal image matching (Maiwald et al, 2023), multi-temporal co-registration problems (Maiwald et al, 2021;, VO and SLAM (Morelli et al, 2023), aerial triangulation (Remondino et al, 2022) and in terrestrial laser scanning point cloud registration (Markiewicz et al, 2023). However, well known limitations of DL approaches are their computational complexity, limited scale and rotation invariance of the descriptors and their application on high-resolution images.…”