2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2017
DOI: 10.1109/cvprw.2017.204
|View full text |Cite
|
Sign up to set email alerts
|

Self-Supervised Neural Aggregation Networks for Human Parsing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
38
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 73 publications
(39 citation statements)
references
References 19 publications
1
38
0
Order By: Relevance
“…It has 4,371 images in total (3,934 for training, and 437 for testing). Evaluation Metrics: For LIP, following its standard protocol [78], we report pixel accuracy, mean accuracy and mean Intersection-over-Union (mIoU). For PASCAL-Person-Part, following conventions [70,71,50], the performance is evaluated in terms of mIoU.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…It has 4,371 images in total (3,934 for training, and 437 for testing). Evaluation Metrics: For LIP, following its standard protocol [78], we report pixel accuracy, mean accuracy and mean Intersection-over-Union (mIoU). For PASCAL-Person-Part, following conventions [70,71,50], the performance is evaluated in terms of mIoU.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…In addition, global image‐level context is proposed to guarantee the coherence between pixel‐wise labelling and image‐label prediction. The end‐to‐end parsing network could effectively eliminate the error accumulation effect of the linear pipeline [10–14, 35]. Therefore, it achieves some state‐of‐the‐art performance on ATR dataset [10].…”
Section: Related Workmentioning
confidence: 99%
“…The multi‐scale operation is another effective way for the various image segmentation problems. Zhao et al [13] proposed an adaptive neural aggregation to fuse the multi‐scale features and achieved state‐of‐the‐art performance on LIP dataset. He and Yang [14] utilised InceptionNet and atrous spatial pyramid pooling structure to generate multi‐scale features and realised a real‐time parsing by pruning the network.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Similarly, the authors of Reference [35] proposed to integrate parsing with optical flow estimation. The authors of Reference [36] incorporated a self-supervised joint loss to ensure the consistency between parsing and pose. However, the guidance from poses cannot improve borders.…”
Section: Related Workmentioning
confidence: 99%