2022
DOI: 10.1109/tip.2021.3128330
|View full text |Cite
|
Sign up to set email alerts
|

Seeing Like a Human: Asynchronous Learning With Dynamic Progressive Refinement for Person Re-Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(6 citation statements)
references
References 69 publications
0
6
0
Order By: Relevance
“…The progressive refinement network has been explored in many supervised tasks, such as supervised image matting [64], person re-identification [65], and temporal action detection [66], motivated by the thought of progressive learning [67], [68]. For example, PBRNet [66] is equipped with three cascaded detection modules for progressive localizing action boundaries more and more precisely.…”
Section: B Progressive Refinementmentioning
confidence: 99%
“…The progressive refinement network has been explored in many supervised tasks, such as supervised image matting [64], person re-identification [65], and temporal action detection [66], motivated by the thought of progressive learning [67], [68]. For example, PBRNet [66] is equipped with three cascaded detection modules for progressive localizing action boundaries more and more precisely.…”
Section: B Progressive Refinementmentioning
confidence: 99%
“…Thus, enriching the input features is necessary for overcoming the defects of the widely used single-scale spatial patch (SP) [30,[32][33][34][35][37][38][39][40] and reducing the learning difficulty. This motivates us to propose a novel shallow feature by taking inspiration from the human visual perception mechanism that processes local and global information in different functional areas of the brain [49,50].…”
Section: Shallow Feature Extractionmentioning
confidence: 99%
“…Due to the existence of diversified characteristics in real changed regions, the singlescale image patch feature, as shown in Figure 3a, which is commonly utilized as the basic processing unit in deep learning-based CD approaches, fails to precisely and flexibly describe diverse characteristics of changed regions and differentiate pseudochanges. Motivated by seeking a new processing unit that can overcome the disadvantages of the single-scale patch, the ESP feature is constructed to better correlate with the human perception mechanism [49,50]. As shown in Figure 3b, the generation of the ESP feature is accomplished by carrying out the following steps.…”
Section: Esp Generationmentioning
confidence: 99%
“…The red box and the red bidirectional arrow indicates to learn the interaction using self-attention and the cross-attention, respectively. development of deep learning, deep feature learning methods [2], [11]- [14] have significantly improved the performance of the person ReID task. However, most of them usually adopt the Convolutional Neural Networks (CNNs) as a backbone to extract deep features, where the long-range dependencies are neglected.…”
Section: Introductionmentioning
confidence: 99%