Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Application 2018
DOI: 10.5220/0006622901140122
|View full text |Cite
|
Sign up to set email alerts
|

Pedestrian Attribute Recognition with Part-based CNN and Combined Feature Representations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(19 citation statements)
references
References 0 publications
0
19
0
Order By: Relevance
“…age, gender).To go further, the framework allows an easy addition of new blocks, even for new features, in order to answer new use-cases, or to have better performance on already covered use-cases. A coming block, is the implementation of the work of our colleagues [3] to get 45 more features (e.g. Causal/Formal upper/lower clothes, carrying plastic bag, gender) of people.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…age, gender).To go further, the framework allows an easy addition of new blocks, even for new features, in order to answer new use-cases, or to have better performance on already covered use-cases. A coming block, is the implementation of the work of our colleagues [3] to get 45 more features (e.g. Causal/Formal upper/lower clothes, carrying plastic bag, gender) of people.…”
Section: Resultsmentioning
confidence: 99%
“…In recent works, the Deep Learning approach applied on large dataset has allowed the extraction of numerous human features (e.g. clothes color, gender, hat, hair color) [2,3]. All these approaches give important clues and tools to characterize and track people and could be based on simple sensors such as a 2D camera.…”
Section: Introductionmentioning
confidence: 99%
“…In fact, many attributes just appear in small regions on human body, so using a unique global feature vector to learn all attributes is inefficient. Recognizing this drawback, [28,29] proposed part-based CNN models, in which a global feature map from a middle layer is horizontally split into 4 disjoint equal local feature maps, each one is then forwarded to some other convolutional layers followed by a last linear layer to predict probabilities of a group of attributes. The improvement in this method is that it use multiple local features to predict groups of suitable attributes.…”
Section: Attribute For Re-idmentioning
confidence: 99%
“…However, most attributes appear in local positions, so global features are insufficient to recognize them. Some works notice this drawback and improve by divide global features into local parts [28,29], but they still consider attributes as an auxiliary branch to enrich deep features.…”
Section: Introductionmentioning
confidence: 99%
“…In order to learn the relationship between the mid-level features and the attributes, these detected layers are trained using image labels only by adopting a max-pooling based weakly-supervised object detection technique. Combining CNN extracted features with LOMO features, a part-based network is proposed by [16]. LOMO features, based on HSV histograms and Scale-Invariant Local Ternary Patterns, are texture and color descriptors shown to be illumination-invariant.…”
Section: Related Workmentioning
confidence: 99%