2019
DOI: 10.1109/access.2019.2951443
|View full text |Cite
|
Sign up to set email alerts
|

Sonar Image Detection Based on Multi-Scale Multi-Column Convolution Neural Networks

Abstract: Automatic detection of underwater objects by sonar images is an important and challenging topic in applications of Autonomous Underwater Vehicle (AUV) under the complex marine environment. A detection method is proposed based on Multi-Scale Multi-Column Convolution Neural Networks (MSMC-CNNs). Firstly, the Multi-Scale Multi-Column CNNs is used to form an encoder network for extracting multi-scale features of the sonar image. Secondly, the bicubic linear interpolation algorithm is used as the deconvolution proc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 36 publications
0
6
0
Order By: Relevance
“…The traditional deep learning based image segmentation and detection models such as FCN [8], SegNet, U-Net and PSP-Net apply bilinear interpolation to recover image resolution during deconvolution [26]. Multi-scale multi-column CNN (MSMC-CNN) adopts the bicubic convolution as a deconvolution for the decoder structure to restore the original image size [27]. The results validate that MSMC-CNN is superior to the other segmentation methods.…”
Section: Smallmentioning
confidence: 91%
See 1 more Smart Citation
“…The traditional deep learning based image segmentation and detection models such as FCN [8], SegNet, U-Net and PSP-Net apply bilinear interpolation to recover image resolution during deconvolution [26]. Multi-scale multi-column CNN (MSMC-CNN) adopts the bicubic convolution as a deconvolution for the decoder structure to restore the original image size [27]. The results validate that MSMC-CNN is superior to the other segmentation methods.…”
Section: Smallmentioning
confidence: 91%
“…The number of the aircraft RSIs of EORSSD is relatively small for training MSCNNA. To increase the image diversity, decrease the overfitting problem, and improve the aircraft detection ability, each original aircraft RSI is augmented by a rotation method (90 0 , 180 0 , 270 0 ) and a randomly cropping method with an overlap of 100 pixels, and then data cleaning is adopted to reduce sub-images without objects [27]. Each image is augmented to 4 images including aircraft targets with 3 orientations and a cropping, and then the augmented dataset contains 1548 images in total for aircraft detection task.…”
Section: Data and Preprocessingmentioning
confidence: 99%
“…e basic network used by YOLOv3 is Darknet53 [11]. To avoid the gradient explosion caused by the deepening of network layers, Darknet53 adds RESNET (residual neural network) [21][22][23][24][25] residual structure. When classifying the target objects, YOLOv3 uses several independent logistic regression classifiers.…”
Section: Yolov3mentioning
confidence: 99%
“…The result represented that if the data was limited, the performance of transfer learning was better than that of the training convolutional neural networks (CNNs) from scratch. In [ 10 ], a CNN has been proven effective in extracting features from an image. A multi-scale and multi-column CNN was used to detect sonar images.…”
Section: Introductionmentioning
confidence: 99%