2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00899
|View full text |Cite
|
Sign up to set email alerts
|

Part-Aware Context Network for Human Parsing

Abstract: As a fine-grained segmentation task, human parsing is still faced with two challenges: inter-part indistinction and intra-part inconsistency, due to the ambiguous definitions and confusing relationships between similar human parts. To tackle these two problems, this paper proposes a novel Affinity-aware Compression and Expansion Network (ACENet), which mainly consists of two modules: Local Compression Module (LCM) and Global Expansion Module (GEM). Specifically, LCM compresses parts-correlation information thr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 56 publications
(22 citation statements)
references
References 57 publications
0
22
0
Order By: Relevance
“…Fine-grained human semantic segmentation, as one of the central tasks in human understanding, has applications in human-centric vision [41,66,43], human-robot interaction [11] and fashion analysis [46]. However, previous studies mainly focus on category-level human parsing [32,15,12,35,47,63,64,49]; only very few human parsers are specifically designed for the instance-aware setting. As of to date, there exist two paradigms for instance-aware human parsing: top-down and bottom-up.…”
Section: Related Workmentioning
confidence: 99%
“…Fine-grained human semantic segmentation, as one of the central tasks in human understanding, has applications in human-centric vision [41,66,43], human-robot interaction [11] and fashion analysis [46]. However, previous studies mainly focus on category-level human parsing [32,15,12,35,47,63,64,49]; only very few human parsers are specifically designed for the instance-aware setting. As of to date, there exist two paradigms for instance-aware human parsing: top-down and bottom-up.…”
Section: Related Workmentioning
confidence: 99%
“…Inference: In the inference time, the pixel accuracy (pixAcc), mean accuracy, and mean pixel Intersection-overunion (mIoU) are leveraged as the evaluation metrics for the LIP dataset, pixel accuracy, precision, recall, and F-1 scores for the ATR dataset, and mIoU for the CIHP dataset. Similar to [10,23,32,41], we averaged the results using 3-scale image pyramids of different scales [0.75, 1.0, 1.25] and flipping for further performance improvement.…”
Section: Experimental Settingsmentioning
confidence: 99%
“…Since the human body contains highly structural information, many previous methods enhance the pixel-level representations with well-designed architectures that can capture the global context cues, such as global context embeddings [4], [26], generative adversarial networks [25], [27], and recurrent models [28], [29]. Apart from pixel-level semantics, human part classes naturally have rich structural semantics, hence, many works model the body part correlations explicitly by building, e.g., graph neural networks [30], [31], tree-like topology information passing architectures [32], [33], and hierarchical human structures [34], [35], [36]. Another direction is exploiting common semantics among different human-centric tasks, e.g., pose estimation and keypoint detection [6], [37], [37], [38], [39], [40], [41] or other prior human semantics, e.g., edge information or human contour [30], [42].…”
Section: Human Parsingmentioning
confidence: 99%
“…PPNet. We also compare our method with PPNet [35], which employs part-aware prototypes. Since one human scene contains multiple human classes and we lack the global semantic class information described in PPNet, we have to remove the semantic branch in [35].…”
Section: Contendersmentioning
confidence: 99%