2021
DOI: 10.1049/ipr2.12195
|View full text |Cite
|
Sign up to set email alerts
|

Jointly human semantic parsing and attribute recognition with feature pyramid structure in EfficientNets

Abstract: Pedestrian attributes recognition is an important issue in computer vision and has a special role in the field of video surveillance. The previous methods presented to solve this issue are mainly based on multi-label end-to-end deep neural networks. These methods neglect to apply attributes for defining local feature areas and they suffer from the problems of the bounding box presence. A new framework for jointly human semantic parsing and pedestrian attribute recognition to achieve effective attribute recogni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…EfficientNet architecture is designed by Tan and Le [16] as a demonstration of their proposed compound scaling method. Scaling CNN model is usually done by increasing depth, width, and resolution which could become tedious and computationally expensive [18]. However, the proposed compound scaling method by Tan and Le [16] uses a compound coefficient to balance the network depth, width, and resolution so that a baseline CNN can be effectively scaled up.…”
Section: Modelingmentioning
confidence: 99%
See 2 more Smart Citations
“…EfficientNet architecture is designed by Tan and Le [16] as a demonstration of their proposed compound scaling method. Scaling CNN model is usually done by increasing depth, width, and resolution which could become tedious and computationally expensive [18]. However, the proposed compound scaling method by Tan and Le [16] uses a compound coefficient to balance the network depth, width, and resolution so that a baseline CNN can be effectively scaled up.…”
Section: Modelingmentioning
confidence: 99%
“…By using the compound scaling method, baseline EfficientNet-B0 is scaled up to obtain EfficientNet-B1 to EfficientNet-B7. All of them are able to perform well with relatively smaller and faster networks [16], [18]. All EfficientNet models achieved high accuracy and efficiency on several widely used datasets, such as ImageNet, CIFAR-100, and Flowers [16].…”
Section: Modelingmentioning
confidence: 99%
See 1 more Smart Citation
“…This model, which uses the EffiCientNet model as the backbone network, has a good performance in image classification by scaling the network width, depth, and input resolution together. As the number of parameters of the model is optimized, it is also advantageous in terms of using system resources [ 60 62 ].…”
Section: Real Environment Application With Hsp-dqnmentioning
confidence: 99%
“…Human parsing has attracted considerable attention in recent years, because it is the core technology that supports many research studies and applications in the fields of retailing [1][2][3], social science [4,5], medicine [6,7], and even security [8,9]. The aim of human parsing is to segment the pixels of an input image into regions according to different labels of body parts and clothes.…”
Section: Introductionmentioning
confidence: 99%