2022
DOI: 10.3390/s22124416
|View full text |Cite
|
Sign up to set email alerts
|

An Unsupervised Transfer Learning Framework for Visible-Thermal Pedestrian Detection

Abstract: Dual cameras with visible-thermal multispectral pairs provide both visual and thermal appearance, thereby enabling detecting pedestrians around the clock in various conditions and applications, including autonomous driving and intelligent transportation systems. However, due to the greatly varying real-world scenarios, the performance of a detector trained on a source dataset might change dramatically when evaluated on another dataset. A large amount of training data is often necessary to guarantee the detecti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 57 publications
0
4
0
Order By: Relevance
“…In conventional supervised learning, a model is trained on labeled instances from the source domain and directly applied to a target domain with a different data distribution, leading to low performance due to domain shift. However, in UTL, the model learns domain-invariant representations, which are features that remain insensitive to domain-specific variations, enabling the model to generalize well to the target domain even without seeing labeled data from that domain during training [17], [18]. UTL finds various applications in real-world scenarios where obtaining labeled instances in the target domain is limited or expensive.…”
Section: Unsupervised Transfer Learning (Utl)mentioning
confidence: 99%
“…In conventional supervised learning, a model is trained on labeled instances from the source domain and directly applied to a target domain with a different data distribution, leading to low performance due to domain shift. However, in UTL, the model learns domain-invariant representations, which are features that remain insensitive to domain-specific variations, enabling the model to generalize well to the target domain even without seeing labeled data from that domain during training [17], [18]. UTL finds various applications in real-world scenarios where obtaining labeled instances in the target domain is limited or expensive.…”
Section: Unsupervised Transfer Learning (Utl)mentioning
confidence: 99%
“…Base part loss.Base parts are defined as areas with similar pixel intensities in infrared and visible images, so the infrared base parts should enjoy high similarity to the visible base parts. The base part loss is computed as (1). We exploit a tanh function to limit the base part loss to the interval of (−1, 1), which can avoid gradient exploding problems in training.…”
Section: Encoder Lossmentioning
confidence: 99%
“…Infrared images captured by infrared sensors to record the thermal radiations emitted by different objects are widely used in object detection and tracking [ 1 , 2 ]. They are robust to the influence of illumination variation and disguises such as objects in smoke.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation