2021
DOI: 10.48550/arxiv.2102.13392
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unifying Remote Sensing Image Retrieval and Classification with Robust Fine-tuning

Abstract: Advances in high resolution remote sensing image analysis are currently hampered by the difficulty of gathering enough annotated data for training deep learning methods, giving rise to a variety of small datasets and associated dataset-specific methods. Moreover, typical tasks such as classification and retrieval lack a systematic evaluation on standard benchmarks and training datasets, which make it hard to identify durable and generalizable scientific contributions. We aim at unifying remote sensing image re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 33 publications
0
1
0
Order By: Relevance
“…Table III shows the performance of GeM on ALEGORIA depending on the training dataset (with our implementation and training protocol). We pick three datasets corresponding to three types of situations encountered in ALEGORIA: GoogleLandmarks [9] (1.4M images, 81k classes, landmark images taken mostly from the ground), SF300 [12] (308k images, 27k classes, aerial vertical and oblique photography) and University1652 [13] (50k images, 701 classes, landmark images in a cross-view setup). We additionaly conduct experiments with Imagenet and SfM-120k (120k images, 55 classes) for reference.…”
Section: B Fine-tuningmentioning
confidence: 99%
“…Table III shows the performance of GeM on ALEGORIA depending on the training dataset (with our implementation and training protocol). We pick three datasets corresponding to three types of situations encountered in ALEGORIA: GoogleLandmarks [9] (1.4M images, 81k classes, landmark images taken mostly from the ground), SF300 [12] (308k images, 27k classes, aerial vertical and oblique photography) and University1652 [13] (50k images, 701 classes, landmark images in a cross-view setup). We additionaly conduct experiments with Imagenet and SfM-120k (120k images, 55 classes) for reference.…”
Section: B Fine-tuningmentioning
confidence: 99%