2020
DOI: 10.3390/rs12030465
|View full text |Cite
|
Sign up to set email alerts
|

A Two-Stream Symmetric Network with Bidirectional Ensemble for Aerial Image Matching

Abstract: In this paper, we propose a novel method to precisely match two aerial images that were obtained in different environments via a two-stream deep network. By internally augmenting the target image, the network considers the two-stream with the three input images and reflects the additional augmented pair in the training. As a result, the training process of the deep network is regularized and the network becomes robust for the variance of aerial images. Furthermore, we introduce an ensemble method that is based… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 34 publications
(34 citation statements)
references
References 59 publications
0
34
0
Order By: Relevance
“…Feature maps can be regarded as d-dimensional dense local feature descriptors from the input images. We adopted the correlation layer used in the regression task in [45] as feature correlation here, as the way it computes similarities in the regression can be extended to the matching task. Feature correlation outputs correlation maps C AB ∈ R (h×w)×h×w made up of scalar products of feature descriptors v ∈ R d at each position in a pair of feature maps.…”
Section: Architecture Of Scorecnnmentioning
confidence: 99%
See 3 more Smart Citations
“…Feature maps can be regarded as d-dimensional dense local feature descriptors from the input images. We adopted the correlation layer used in the regression task in [45] as feature correlation here, as the way it computes similarities in the regression can be extended to the matching task. Feature correlation outputs correlation maps C AB ∈ R (h×w)×h×w made up of scalar products of feature descriptors v ∈ R d at each position in a pair of feature maps.…”
Section: Architecture Of Scorecnnmentioning
confidence: 99%
“…SE-ResNeXt, which applies the Squeeze-and-Excitation (SE) module to ResNeXt, focuses on more important features. We embed the first three layers of the backbone and apply L2-normalization [45]. Given the coordinates (x s , y s ) and (x t , y t ) corresponding to the sub-image pair (I s , I t ) at the upper left corner or center of the original images, we can obtain the 2-D vectors v A and v B by position embedding, as follows:…”
Section: Architecture Of Etpmentioning
confidence: 99%
See 2 more Smart Citations
“…Experimental results show that this method has a high accuracy. Vakalopoulou et al [122] proposed a CNN-based network for estimating the rigid and deformable parameters between high-resolution satellite image pairs, and then used a 2D space converter layer to align the source image with the target image. Park et al [123] proposed an end-to-end scalable network with a two-stream architecture and a bidirectional training architecture for estimating the transformation parameters for aerial image pairs.…”
Section: Usingmentioning
confidence: 99%