2020
DOI: 10.1002/rob.21961
|View full text |Cite
|
Sign up to set email alerts
|

Learning features from georeferenced seafloor imagery with location guided autoencoders

Abstract: Although modern machine learning has the potential to greatly speed up the interpretation of imagery, the varied nature of the seabed and limited availability of expert annotations form barriers to its widespread use in seafloor mapping applications. This motivates research into unsupervised methods that function without large databases of human annotations. This paper develops an unsupervised feature learning method for georeferenced seafloor visual imagery that considers patterns both within the footprint of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
43
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 32 publications
(45 citation statements)
references
References 37 publications
2
43
0
Order By: Relevance
“…In our implementation, AlexNet [7] and its inverted architecture are used as the encoder and decoder, respectively, where any type of neural network can be used to construct autoencoder in a similar way. Our previous LGA method [19] can be regarded as a specific case of eq. ( 1), where only L loc and λ loc are used.…”
Section: B Implementation For Georeferenced Imagerymentioning
confidence: 99%
See 3 more Smart Citations
“…In our implementation, AlexNet [7] and its inverted architecture are used as the encoder and decoder, respectively, where any type of neural network can be used to construct autoencoder in a similar way. Our previous LGA method [19] can be regarded as a specific case of eq. ( 1), where only L loc and λ loc are used.…”
Section: B Implementation For Georeferenced Imagerymentioning
confidence: 99%
“…To investigate the effectiveness of the proposed regularisation, the autoencoder is trained (i) without regularisation, (ii) with L loc , (iii) with L dep , (iv) with both L loc and L dep on all 32,097 images in the dataset. AlexNet [7] with batch normalisation is used as the encoder architecture, and its inverse is used as the decoder where the number of dimensions of the encoder output (equal to the number of dimensions of the decoder input) is set to 16 in accordance with our previous work [19]. The autoencoder weights are initialised with the values of AlexNet pre-trained on ImageNet.…”
Section: B Autoencoder Trainingmentioning
confidence: 99%
See 2 more Smart Citations
“…Walker et al [31] use physics based color correction and scale normalization on underwater images to reduce the generalization error of a DeepLabV3+ model [32] for image segmentation. Similarly, Yamada et al [33] use color correction and image rescaling to enhance their method for unsupervised feature learning of georeferenced sea floor images. All methods are applied to a single dataset and are not used for knowledge transfer to enable cross-dataset machine learning.…”
Section: Introductionmentioning
confidence: 99%