2021
DOI: 10.1007/s10846-021-01499-8
|View full text |Cite
|
Sign up to set email alerts
|

Unmanned Surface Vessel Visual Object Detection Under All-Weather Conditions with Optimized Feature Fusion Network in YOLOv4

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 27 publications
(7 citation statements)
references
References 33 publications
0
7
0
Order By: Relevance
“…The results show that their algorithm can achieve automatic analysis and statistical data extraction in waterways videos. Sun et al (2021) optimized the backbone network CSPDarkNet of YOLOv4 for application in an auxiliary intelligent ship navigation system. They added a receptive field block module, and the FPN of YOLOv4 was improved by combining the Transformer mechanism.…”
Section: Ship Detection Based On Yolomentioning
confidence: 99%
“…The results show that their algorithm can achieve automatic analysis and statistical data extraction in waterways videos. Sun et al (2021) optimized the backbone network CSPDarkNet of YOLOv4 for application in an auxiliary intelligent ship navigation system. They added a receptive field block module, and the FPN of YOLOv4 was improved by combining the Transformer mechanism.…”
Section: Ship Detection Based On Yolomentioning
confidence: 99%
“…Next, variants of Resnet-50, VGG16, ResNext50, or Darknet52, can be the backbone, which refers to the network that takes the image as input and retrieves the feature map. The neck and head are backbone sub-sets that maximize the discriminability and robustness of the function using FPN, PAN, RFB, etc., and the forecast-managing head [23][24][25]. For a single-stage detector such as YOLO and SSD, this may either be used for dense prediction or FRCNN and Mask RCNN, a twostage detector also known as Sparse Prediction.…”
Section: Yolov4 For Detectionmentioning
confidence: 99%
“…Take the 13 × 13 grid as an example, which is equal to dividing the input Mel spectrogram into 13 × 13 squares; then, each square will be preset with three prior frames. The classification results of the network will adjust the positions of these three prior boxes and finally filter by the nonmaximum suppression (NMS) algorithm [ 28 ], so as to get the final classification results.…”
Section: Spp-yolo-v4 Network Structurementioning
confidence: 99%