2023
DOI: 10.3390/make5040083
|View full text |Cite
|
Sign up to set email alerts
|

A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS

Juan Terven,
Diana-Margarita Córdova-Esparza,
Julio-Alejandro Romero-González

Abstract: YOLO has become a central real-time object detection system for robotics, driverless cars, and video monitoring applications. We present a comprehensive analysis of YOLO’s evolution, examining the innovations and contributions in each iteration from the original YOLO up to YOLOv8, YOLO-NAS, and YOLO with transformers. We start by describing the standard metrics and postprocessing; then, we discuss the major changes in network architecture and training tricks for each model. Finally, we summarize the essential … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
169
0
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 649 publications
(170 citation statements)
references
References 113 publications
0
169
0
1
Order By: Relevance
“…2a). 12,13 The detection model output a bounding box for the bAVM. In the second stage, an image masked by the bounding box was input into the U-Net and U-Net++ architecture, and a segmentation mask was outputted (Fig.…”
Section: Data Preprocessing and Deep Learning Approachesmentioning
confidence: 99%
See 4 more Smart Citations
“…2a). 12,13 The detection model output a bounding box for the bAVM. In the second stage, an image masked by the bounding box was input into the U-Net and U-Net++ architecture, and a segmentation mask was outputted (Fig.…”
Section: Data Preprocessing and Deep Learning Approachesmentioning
confidence: 99%
“…YOLOv5x and YOLOv8x, which had the best performance but a relatively long inference time, were used. 12,13 The models with pretraining and image augmentation methods were compared to the model without those methods. Additionally, two types of pretrained weights were tested: the weight training by the COCO (Common Objects In Context) dataset provided by YOLOv5 and YOLOv8 originally and the weight training by the MRI images with YOLOv5 to detect brain tumors (Fig.…”
Section: Yolov5 and Yolov8 Configurationmentioning
confidence: 99%
See 3 more Smart Citations