2023
DOI: 10.3390/s23041801
|View full text |Cite
|
Sign up to set email alerts
|

YOLOv7-RAR for Urban Vehicle Detection

Abstract: Aiming at the problems of high missed detection rates of the YOLOv7 algorithm for vehicle detection on urban roads, weak perception of small targets in perspective, and insufficient feature extraction, the YOLOv7-RAR recognition algorithm is proposed. The algorithm is improved from the following three directions based on YOLOv7. Firstly, in view of the insufficient nonlinear feature fusion of the original backbone network, the Res3Unit structure is used to reconstruct the backbone network of YOLOv7 to improve … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
32
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 76 publications
(32 citation statements)
references
References 46 publications
0
32
0
Order By: Relevance
“…Although the YOLOv7-tiny algorithm is widely used in target detection tasks, the YOLOv7-tiny algorithm has a weak ability to extract small target features and cannot complete traffic target detection tasks effectively 20 . In traffic images, relatively high-level features contain rich semantic information, but due to multi-level convolutional operation, the position information of small targets is severely lost, whereas low-level features contain rich position information and detailed information of small targets 21 . To extract the small target semantic feature information in the feature map, we proposed the new feature enhancement module named enhance feature fusion module (EFFM) (as shown in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…Although the YOLOv7-tiny algorithm is widely used in target detection tasks, the YOLOv7-tiny algorithm has a weak ability to extract small target features and cannot complete traffic target detection tasks effectively 20 . In traffic images, relatively high-level features contain rich semantic information, but due to multi-level convolutional operation, the position information of small targets is severely lost, whereas low-level features contain rich position information and detailed information of small targets 21 . To extract the small target semantic feature information in the feature map, we proposed the new feature enhancement module named enhance feature fusion module (EFFM) (as shown in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…The backbone comprises a convolutional structure and an aggregation network ELAN. Extracted features are then directed to the global feature enhancement ST module and the SPPCSPC [28] module. The PAFPN [29] structure is employed to extract input information of varying sizes for fusion.…”
Section: The Proposed Theorymentioning
confidence: 99%
“…As a result, they necessitate extensive hardware resources for both training and inferencing. The You Only Look Once (YOLO) algorithm series [14][15][16][17][18][19][20] are one-stage detection methodologies, requiring a singular feature extraction phase for complete target detection, and are lauded for their superior efficiency. Wang et al [14] proposed the YOLOv7 model, which integrates an efficient long-range attention network, and a cascade scaling approach.…”
Section: Introductionmentioning
confidence: 99%
“…However, its intrinsic limitation is the large downsampling multiplier, reducing its capability to discern deep features in tiny targets, which predisposes it to potential detection lapses. For this purpose, Zhang et al [16] harnessed the Acmix attention mechanism to enhance the feature extraction capability of the YOLOv7 model, particularly for minute targets. However, the Acmix attention mechanism makes flexible allocation determination difficult.…”
Section: Introductionmentioning
confidence: 99%