2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00895
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Human Parsing With Typed Part-Relation Reasoning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
56
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 110 publications
(57 citation statements)
references
References 46 publications
1
56
0
Order By: Relevance
“…Differentiable attention mechanisms enable a neural network to focus more on relevant elements of the input than on irrelevant parts. With their popularity in the field of natural language processing [8,39,43,49,60], attention modeling is rapidly adopted in various computer vision tasks, such as image recognition [14,23,58,66,73], domain adaptation [67,83], human pose estimation [9,63,77], object detection [4] and image generation [76,81,86]. Further, co-attention mechanisms become an essential tool in many vision-language applications and sequential modeling tasks, such as visual question answering [41,44,75,78], visual dialog [74,84], vision-language navigation [68], and video segmentation [42,61], showing its effectiveness in capturing the underlying relations between different entities.…”
Section: Related Workmentioning
confidence: 99%
“…Differentiable attention mechanisms enable a neural network to focus more on relevant elements of the input than on irrelevant parts. With their popularity in the field of natural language processing [8,39,43,49,60], attention modeling is rapidly adopted in various computer vision tasks, such as image recognition [14,23,58,66,73], domain adaptation [67,83], human pose estimation [9,63,77], object detection [4] and image generation [76,81,86]. Further, co-attention mechanisms become an essential tool in many vision-language applications and sequential modeling tasks, such as visual question answering [41,44,75,78], visual dialog [74,84], vision-language navigation [68], and video segmentation [42,61], showing its effectiveness in capturing the underlying relations between different entities.…”
Section: Related Workmentioning
confidence: 99%
“…Fine-grained human semantic segmentation, as one of the central tasks in human understanding, has applications in human-centric vision [41,66,43], human-robot interaction [11] and fashion analysis [46]. However, previous studies mainly focus on category-level human parsing [32,15,12,35,47,63,64,49]; only very few human parsers are specifically designed for the instance-aware setting. As of to date, there exist two paradigms for instance-aware human parsing: top-down and bottom-up.…”
Section: Related Workmentioning
confidence: 99%
“…Other studies have combined additional human prior information for human parsing. For instance, Wang et al [32] assembled the compositional hierarchy of human bodies for efficient and complete human parsing, and Ji et al [19] exploited the intrinsic physiological structure of the human body by designing a novel semantic neural tree for human parsing. Utilizing grammar rules in a cascaded and parallel manner, Zhang et al [42] employed the inherent hierarchical structure of the human body and the relationship of different human parts to achieve impressive human parsing results.…”
Section: Related Workmentioning
confidence: 99%
“…Ji et al [19] designed a novel semantic neural tree to encode the physiological structure of the human body and achieved competitive results. Wang et al [31,32] exploited deep graph networks and hierarchical human structures to capture the relation information of human parts and obtained better performances. These mechanisms involve designing a complex semantic tree or message-passing network that leads to heavy computing complexity while improving performance.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation