2021
DOI: 10.48550/arxiv.2109.14379
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

Abstract: The infrared small-dim target detection is one of the key techniques in the infrared search and tracking system. Since the local regions similar to infrared small-dim targets spread over the whole background, exploring the interaction information amongst image features in large-range dependencies to mine the difference between the target and background is crucial for robust detection. However, existing deep learningbased methods are limited by the locality of convolutional neural networks, which impairs the ab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 53 publications
0
12
0
Order By: Relevance
“…We select some traditinal methods: Top-Hat [37], LCM [3], WLDM [21], NARM [38], PSTNN [39], IPI [14], RIPT [8], NIPPS [9] and several open source deep learning SOTA methods including MDvsFA [], ACM [10], ALC [11], AGCP [41] and Transformer [19] which for comparison. The results are shown in Table 2, deep learning methods basically perform better than traditional ones due to their great power of feature extraction and generalization.…”
Section: Quantitative Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We select some traditinal methods: Top-Hat [37], LCM [3], WLDM [21], NARM [38], PSTNN [39], IPI [14], RIPT [8], NIPPS [9] and several open source deep learning SOTA methods including MDvsFA [], ACM [10], ALC [11], AGCP [41] and Transformer [19] which for comparison. The results are shown in Table 2, deep learning methods basically perform better than traditional ones due to their great power of feature extraction and generalization.…”
Section: Quantitative Resultsmentioning
confidence: 99%
“…Dai et al [10] proposed an asymmetric contextual modulation to help network performance well and introduced the first public ISOS dataset SIRST in real scenes, Dai et al [11] further applied a handcraft dilated local contrast measure into network. Liu et al [19] firstly introduced multi-head self-attention into ISOS tasks and got a good result. Zhang et al [41] proposed AGPCNet with attentionguided context block and context pyramid module.…”
Section: Isosmentioning
confidence: 99%
See 1 more Smart Citation
“…MAResU-Net [42] add the self-attention module to CNN for remote sensing image segmentation. After obtaining image features from CNN, Liu et al adopt the self-attention mechanism to learn the interaction information of image features in a larger range [43]. Unlike it, our network extracts features by a pure transformer structure and does not utilize the convolutional backbone network.…”
Section: Transformer For Computer Visionmentioning
confidence: 99%
“…4 (b) that more than half of the targets contains about 20 pixels. Usually, small targets (e.g., aircraft, missiles) move rapidly in complex and variable clutter, making infrared images have a very low signal-to-clutter ratio (SCR) [61]. SCR [48], [62] is used to measure the target intensity and background intensity.…”
Section: Infrared Dim Small Target Datasetsmentioning
confidence: 99%