RF-Net: An End-To-End Image Matching Network Based on Receptive Field

Shen, Xuelun; Wang, Cheng; Li, Xin; Yu, Zenglei; Li, Jonathan; Wen, Chenglu; Cheng, Ming; He, Zhongshi

doi:10.1109/cvpr.2019.00832

Cited by 97 publications

(54 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Feature matching networks (Yi et al, 2016, Ono et al, 2018, Christiansen et al, 2019, Shen et al, 2019, Kniaz et al, 2020 seems to outperform handcrafted feature detectors/descriptor methods. Still, their performance is closely related to the similarity of local image patches in the training dataset with respect to the images used during inference.…”

Section: Deep Convolutional Neural Networkmentioning

confidence: 99%

Wire Structure Image-Based 3d Reconstruction Aided by Deep Learning

Kniaz

Zheltov

Remondino

et al. 2020

Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.

View full text Add to dashboard Cite

Abstract. Objects and structures realized by connecting and bending wires are common in modern architecture, furniture design, metal sculpting, etc. The 3D reconstruction of such objects with traditional range- or image-based methods is very difficult and poses challenges due to their unique characteristics such as repeated structures, slim elements, holes, lack of features, self-occlusions, etc. Complete 3D models of such complex structures are normally reconstructed with lots of manual intervention as automated processes fail in providing detailed and accurate 3D reconstruction results.This paper presents the image-based 3D reconstruction of the Shukhov hyperboloid tower in Moscow, a wire structure built in 1922, composed of a series of hyperboloid sections stacked one to another to approximate an overall conical shape. A deep learning approach for image segmentation was developed in order to robustly detect wire structures in images and provide the basis for accurate corresponding problem solutions. The developed WireNet convolution neural network (CNN) model has been used to aid the multi-view stereo (MVS) process and to improve robustness and accuracy of the image-based 3D reconstruction approach, otherwise not feasible without masking the images automatically.

show abstract

Section: Deep Convolutional Neural Networkmentioning

confidence: 99%

Wire Structure Image-Based 3d Reconstruction Aided by Deep Learning

Kniaz

Zheltov

Remondino

et al. 2020

Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.

View full text Add to dashboard Cite

show abstract

“…The experiments were performed using different remote sensing images to evaluate the performance and robustness of the algorithm. Seven remote sensing registration algorithms were used as comparison groups: SIFT [14], SURF [16], FSC-SIFT [20], PSO-SIFT [17], SAR-SIFT [19], PSO-SIFT-CNN [23] and RF-Net [31].…”

Section: F Experimental Setting and Datasetsmentioning

confidence: 99%

“…The fourth approach is based on deep network image matching (registration) not only can extract the deep features of key points, but also can automatically extract key points.Methods that apply this approach include D2-Net [29], Super-Point [30], and RF-Net models [31]. These methods are commonly used for natural image matching, and have good robustness to angle and illumination.…”

Section: Introductionmentioning

confidence: 99%

Registration of Multiresolution Remote Sensing Images Based on L2-Siamese Model

Fan

Hou

Liu

et al. 2021

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

The registration of multi-resolution optical remote sensing images has been widely used in image fusion, change detection, and image stitching. However, traditional registration methods achieve poor accuracy in the registration of multiresolution remote sensing images. In this study, we propose a framework for generating deep features via a deep residual encoder (DRE) fused with shallow features for multi-resolution remote sensing image registration. Through an L2 normalization Siamese network (L2-Siamese) based on the DRE, the multiscale loss function is used to learn the attribute characteristics and distance characteristics of two key points and obtain the trained feature extractor. Finally, the DRE is used to extract the deep features of the key points and their neighbors, which are concatenated with the shallow features into a fusion feature vector to complete the image registration. We performed comprehensive experiments on four sets of multi-resolution optical remote sensing images and two sets of synthetic aperture radar images. The results demonstrate that the proposed registration model can achieve sub-pixel registration. The relative registration accuracy improved by 1.6 − 7.5%, whereas the overall performance improved by 4.5 − 14.1%.

show abstract

“…For example, in the pulmonary lesion or multi-organ segmentation task, the edge detail of the smaller lesion/organ is not fine by the large receptor field and the structure of the lesion/organ is not obvious by the small receptor field. Therefore, it is very important to use the convolution kernel with different receptive fields to process the image (Luo et al, 2016 ; Peng et al, 2017 ; Shen et al, 2019 ). In the natural image processing task, satisfactory results are obtained by combining the convolution of different receptive fields (Seif and Androutsos, 2018 ).…”

Section: Introductionmentioning

confidence: 99%

MSU-Net: Multi-Scale U-Net for 2D Medical Image Segmentation

Zhang

Liu

et al. 2021

Front. Genet.

View full text Add to dashboard Cite

Aiming at the limitation of the convolution kernel with a fixed receptive field and unknown prior to optimal network width in U-Net, multi-scale U-Net (MSU-Net) is proposed by us for medical image segmentation. First, multiple convolution sequence is used to extract more semantic features from the images. Second, the convolution kernel with different receptive fields is used to make features more diverse. The problem of unknown network width is alleviated by efficient integration of convolution kernel with different receptive fields. In addition, the multi-scale block is extended to other variants of the original U-Net to verify its universality. Five different medical image segmentation datasets are used to evaluate MSU-Net. A variety of imaging modalities are included in these datasets, such as electron microscopy, dermoscope, ultrasound, etc. Intersection over Union (IoU) of MSU-Net on each dataset are 0.771, 0.867, 0.708, 0.900, and 0.702, respectively. Experimental results show that MSU-Net achieves the best performance on different datasets. Our implementation is available at https://github.com/CN-zdy/MSU_Net.

show abstract

RF-Net: An End-To-End Image Matching Network Based on Receptive Field

Cited by 97 publications

References 30 publications

Wire Structure Image-Based 3d Reconstruction Aided by Deep Learning

Wire Structure Image-Based 3d Reconstruction Aided by Deep Learning

Registration of Multiresolution Remote Sensing Images Based on L2-Siamese Model

MSU-Net: Multi-Scale U-Net for 2D Medical Image Segmentation

Contact Info

Product

Resources

About