2021
DOI: 10.3390/s21051820
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Abstract: The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which im… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…Zhang [11] designed the OR-CNN network and utilized the PORol pooling unit to partition the human body into five parts for feature fusion. Shao [12] inspired by FPN, proposed the multi-scale feature pyramid network (MFPN) based on ResNet [13] . It fuses features from different scales to enhance information extraction ability for occluded targets.…”
Section: Introductionmentioning
confidence: 99%
“…Zhang [11] designed the OR-CNN network and utilized the PORol pooling unit to partition the human body into five parts for feature fusion. Shao [12] inspired by FPN, proposed the multi-scale feature pyramid network (MFPN) based on ResNet [13] . It fuses features from different scales to enhance information extraction ability for occluded targets.…”
Section: Introductionmentioning
confidence: 99%
“…This approach overcomes the limitation of single proposal boxes predicting a single target and introduces the CrowdDet network model, capable of predicting multiple targets from a single proposal box. Building on this, Shao et al [11] used a residual network (ResNet) as a base and enhanced the detection rate for targets with incomplete feature information by integrating multi-scale feature pyramids for feature fusion. To further improve the detection of occluded objects, Yang et al [12] proposed a combined Rep-GIoU loss by blending repulsion loss with GIoU loss.…”
Section: Introductionmentioning
confidence: 99%
“…Shao et al [12] combined FPN and ResNet to extract and fuse image features, which enhanced the semantic information and contour of occluded pedestrians and improved the performance of pedestrian detection. Pereira et al [13] compared the impact of different data-association measures on the tracking effect of the DeepSORT algorithm; it can be seen that better data-association measures can improve the tracking effect of tracking algorithms.…”
Section: Introductionmentioning
confidence: 99%