2018
DOI: 10.1007/978-3-030-01261-8_49
|View full text |Cite
|
Sign up to set email alerts
|

Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
75
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
3
2

Relationship

3
7

Authors

Journals

citations
Cited by 132 publications
(75 citation statements)
references
References 26 publications
0
75
0
Order By: Relevance
“…The object tracking problem of RGB-T is an extension of the traditional visual tracking task, that is, given the initial position state of the target, the RGB and thermal infrared image are comprehensively used to continuously estimate the target position in subsequent scenes. In recent years, several works have been carried out on this RGB-T tracking research, and representative approaches are roughly divided into two categories: Tracking based on traditional manual features [1][2][3][4][5][6][7] and tracking based on deep learning [8][9][10][11].The former category is mostly based on theoretical frameworks such as sparse representation [2][3][4][5], correlation filtering [6], Bayesian filtering [7], and uses hand-crafted textures or local features to construct cross-modal object appearance model and state estimation methods. The latter class builds the effective model of modeling targets from massive data by exploring the powerful feature representation capabilities of deep neural networks.…”
Section: Introductionmentioning
confidence: 99%
“…The object tracking problem of RGB-T is an extension of the traditional visual tracking task, that is, given the initial position state of the target, the RGB and thermal infrared image are comprehensively used to continuously estimate the target position in subsequent scenes. In recent years, several works have been carried out on this RGB-T tracking research, and representative approaches are roughly divided into two categories: Tracking based on traditional manual features [1][2][3][4][5][6][7] and tracking based on deep learning [8][9][10][11].The former category is mostly based on theoretical frameworks such as sparse representation [2][3][4][5], correlation filtering [6], Bayesian filtering [7], and uses hand-crafted textures or local features to construct cross-modal object appearance model and state estimation methods. The latter class builds the effective model of modeling targets from massive data by exploring the powerful feature representation capabilities of deep neural networks.…”
Section: Introductionmentioning
confidence: 99%
“…In [3], they proposed a patch-based graph model to learn object feature presentation for RGB-T tracking, where the graph is optimized via weighted sparse representations that utilize multi-modality information adaptively. Li et al [4] provided a graph-based cross-modal ranking model for RGB-T tracking, in which the soft cross-modality consistency between modalities and the optimal query learning are introduced to improve the robustness.…”
Section: B Rgb-t Vision Methodsmentioning
confidence: 99%
“…RGBT tracking receives much attention recently and becomes more and more popular [17,21,22,16,20,23]. Recent works [21,17,16] employ reconstruction residues [21,17] or classification scores [16] to guide the weights learning of modalities to achieve adaptive fusion of RGB and thermal modalities.…”
Section: Rgbt Tracking Methodsmentioning
confidence: 99%