2022
DOI: 10.48550/arxiv.2203.16250
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PP-YOLOE: An evolved version of YOLO

Abstract: In this report, we present PP-YOLOE, an industrial state-of-the-art object detector with high performance and friendly deployment. We optimize on the basis of the previous PP-YOLOv2, using anchor-free paradigm, more powerful backbone and neck equipped with CSPRepResStage, ET-head and dynamic label assignment algorithm TAL. We provide s/m/l/x models for different practice scenarios. As a result, PP-YOLOE-l achieves 51.4 mAP on COCO testdev and 78.1 FPS on Tesla V100, yielding a remarkable improvement of (+1.9 A… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
46
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 51 publications
(47 citation statements)
references
References 23 publications
1
46
0
Order By: Relevance
“…The experiment results illustrated in Table 5 fully confirm that the proposed planned re-parameterized model is equally effective on residual-based model. We find that the design of RepCSPResNet [85] also fit our design pattern.…”
Section: Ablation Study 541 Proposed Compound Scaling Methodssupporting
confidence: 61%
See 1 more Smart Citation
“…The experiment results illustrated in Table 5 fully confirm that the proposed planned re-parameterized model is equally effective on residual-based model. We find that the design of RepCSPResNet [85] also fit our design pattern.…”
Section: Ablation Study 541 Proposed Compound Scaling Methodssupporting
confidence: 61%
“…Currently state-of-the-art real-time object detectors are mainly based on YOLO [61,62,63] and FCOS [76,77], which are [3,79,81,21,54,85,23]. Being able to become a state-of-the-art real-time object detector usually requires the following characteristics: (1) a faster and stronger network architecture; (2) a more effective feature integration method [22,97,37,74,59,30,9,45]; (3) a more accurate detection method [76,77,69]; (4) a more robust loss function [96,64,6,56,95,57]; (5) a more efficient label assignment method [99,20,17,82,42]; and (6) a more efficient training method.…”
Section: Related Work 21 Real-time Object Detectorsmentioning
confidence: 99%
“…We test the speed performance of all official models with FP16-precision on the same Tesla T4 GPU with TensorRT [11]. We compare the upgraded YOLOv6 with YOLOv5 [5], YOLOX [3], PPYOLOE [17], YOLOv7 [16] and YOLOv8 [6]. Note that the performance of YOLOv7-Tiny is re-evaluated according to their open-sourced code and weights at the input size of 416 and 640.…”
Section: Comparisonsmentioning
confidence: 99%
“…YOLOv4 [1] reorganized the detection framework into several separate parts (backbone, neck and head), and verified bag-of-freebies and bag-of-specials at the time to design a framework suitable for training on a single GPU. At present, YOLOv5 [5], YOLOX [3], PPY-OLOE [17], YOLOv7 [16] and most recently YOLOv8 [6] are all the competing candidates for efficient detectors to deploy.…”
Section: Introductionmentioning
confidence: 99%
“…The deployment of efficient convolutional neural networks (CNNs) enabled immense progress [4,9,23,[28][29][30]40] in vision detectors for edge devices, in which they consistently reduce parameters and speed counts for improving accuracy. However, these metrics are not correlated well with the efficiency of the models in terms of energy.…”
Section: Introductionmentioning
confidence: 99%