2022
DOI: 10.3389/fmars.2022.1056300
|View full text |Cite
|
Sign up to set email alerts
|

Underwater object detection algorithm based on attention mechanism and cross-stage partial fast spatial pyramidal pooling

Abstract: For the routine target detection algorithm in the underwater complex environment to obtain the image of the existence of blurred images, complex background and other phenomena, leading to difficulties in model feature extraction, target miss detection and other problems. Meanwhile, an improved YOLOv7 model is proposed in order to improve the accuracy and real-time performance of the underwater target detection model. The improved model is based on the single-stage target detection model YOLOv7, incorporating t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(19 citation statements)
references
References 29 publications
0
19
0
Order By: Relevance
“…The selection of the number of integrated CBAM blocks during the incorporation of CBAM modules was done meticulously to minimize any significant increase in the computational overhead on the network. The integration of CBAM modules into the YOLOv7 architecture by researchers to develop novel designs for solving various problems has emerged as a highly popular research area [ 49 , 50 ]. Therefore, the determination of position, number of blocks, and their integration with other network components are design decisions that can vary depending on the specific problem under consideration.…”
Section: Methodsmentioning
confidence: 99%
“…The selection of the number of integrated CBAM blocks during the incorporation of CBAM modules was done meticulously to minimize any significant increase in the computational overhead on the network. The integration of CBAM modules into the YOLOv7 architecture by researchers to develop novel designs for solving various problems has emerged as a highly popular research area [ 49 , 50 ]. Therefore, the determination of position, number of blocks, and their integration with other network components are design decisions that can vary depending on the specific problem under consideration.…”
Section: Methodsmentioning
confidence: 99%
“…That is, the running speed is further improved by fusing the feature maps of various receptive fields, which enriches the expressive power of the feature maps. With the continuous innovation of the YOLO algorithm, the module for the spatial pyramid pooling is further improved into SPPCSPC 27 and SPPFCSPC 28 . The CSP structure is utilized to divide the features into two parts, one of which is processed for regular convolution, and the other is processed for the SPP structure.…”
Section: Methodsmentioning
confidence: 99%
“…A detailed explanation of CBAM, CAM, and SAM can be found in [26]. Recent studies, including [27,28], have demonstrated successful combinations of convolutional block attention modules (CBAM) with convolutional neural networks in computer vision tasks. However, these studies have not provided the source code for the implementation of CBAM modules or the configuration of the network structure, making the reproducibility of reported results unfeasible.…”
Section: Convolutional Block Attention Module (Cbam)mentioning
confidence: 99%