Deep Learning Approach Based on GBSAR Data for Detection of Defects in Packed Objects

Turčinović, Filip; Kačan, Marin; Bojanjac, Dario; Bosiljevac, Marko

doi:10.23919/eucap57121.2023.10133345

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article2

Other1

Relationship

Self Cite0

Independent3

Authors

Journals

Cited by 3 publications

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Reconfigurable IoT Solution for Train Integrity and Monitoring

Capitão,

Pinho,

Carvalho

2024

IEEE Internet Things J.

View full text Add to dashboard Cite

Reconfigurable IoT Solution for Train Integrity and Monitoring

Capitão,

Pinho,

Carvalho

2024

IEEE Internet Things J.

View full text Add to dashboard Cite

Oriented R-CNN With Disentangled Representations for Product Packaging Detection

Pan,

Yang,

Liu

et al. 2024

IEEE Photonics J.

View full text Add to dashboard Cite

VTR: an optimized vision transformer for SAR ATR acceleration on FPGA

Wickramasinghe,

Parikh,

Zhang

et al. 2024

Image Sensing Technologies: Materials, Devices, Systems, and Applications XI

View full text Add to dashboard Cite

Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR) is a key technique used in military applications like remote-sensing image recognition. Vision Transformers (ViTs) are the state-of-the-art in various computer vision applications, outperforming Convolutional Neural Networks (CNNs). However, using ViTs for SAR ATR applications is challenging due to (1) standard ViTs require extensive training data to generalize well due to their low locality. The standard SAR datasets have a limited number of labeled training data, reducing the learning capability of ViTs (2) ViTs have a high parameter count and are computation intensive which makes their deployment on resource-constrained SAR platforms difficult. In this work, we develop a lightweight ViT model that can be trained directly on small datasets without pre-training. To this end, we incorporate the Shifted Patch Tokenization (SPT) and Locality Self-Attention (LSA) modules into the ViT model. We directly train this model on SAR datasets to evaluate its effectiveness for SAR ATR applications. The proposed model, VTR (ViT for SAR ATR), is evaluated on three widely used SAR datasets: MSTAR, SynthWakeSAR, and GBSAR. Experimental results show that the proposed VTR model achieves a classification accuracy of 95.96%, 93.47%, and 99.46% on MSTAR, SynthWakeSAR, and GBSAR datasets, respectively. VTR achieves accuracy comparable to the state-of-the-art models on MSTAR and GBSAR datasets with 1.1× and 36× smaller model sizes, respectively. On SynthWakeSAR dataset, VTR achieves a higher accuracy with a model size that is 17× smaller. Further, a novel FPGA accelerator is proposed for VTR, to enable real-time SAR ATR applications. Compared with the implementation of VTR on state-of-the-art CPU and GPU platforms, our FPGA implementation achieves latency reduction by a factor of 70× and 30×, respectively. For inference on small batch sizes, our FPGA implementation achieves a 2× higher throughput compared with GPU.

show abstract

Deep Learning Approach Based on GBSAR Data for Detection of Defects in Packed Objects

Cited by 3 publications

References 8 publications

Reconfigurable IoT Solution for Train Integrity and Monitoring

Reconfigurable IoT Solution for Train Integrity and Monitoring

Oriented R-CNN With Disentangled Representations for Product Packaging Detection

VTR: an optimized vision transformer for SAR ATR acceleration on FPGA

Contact Info

Product

Resources

About