Lightweight ship detection method based on Swin-YOLOFormer

Cen, Jian; Chen, Jiahao; Liu, Xi; Feng, Hao; Li, Jiaxi; Li, Haisheng; Huang, Weisheng

doi:10.1117/1.jei.33.1.013043

J. Electron. Imag.

2024

DOI: 10.1117/1.jei.33.1.013043

|View full text |Cite

Lightweight ship detection method based on Swin-YOLOFormer

Jian Cen,

Jiahao Chen,

Xi Liu

et al.

Abstract: Deep learning models have achieved great success in the field of ship detection, but these models often require a large amount of computing and storage resources, and are not suitable for some resource-constrained situations. To solve the above problems, we propose a lightweight Swin-YOLOFormer ship detection method. First, in terms of the backbone network, the Swin transformer lightweight model is introduced to reduce the redundancy parameters of the backbone network. Second, in the feature fusion network, an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Joint merging and pruning: adaptive selection of better token compression strategy

Peng,

Zeng,

Zhang

et al. 2024

J. Electron. Imag.

View full text Add to dashboard Cite

Vision transformer (ViT) is widely used to handle artificial intelligence tasks, making significant advances in a variety of computer vision tasks. However, due to the secondary interaction between tokens, the ViT model is inefficient, which greatly limits the application of the ViT model in real scenarios. In recent years, people have noticed that not all tokens contribute equally to the final prediction of the model, so token compression methods have been proposed, which are mainly divided into token pruning and token merging. Yet, we believe that neither pruning only to reduce non-critical tokens nor merging to reduce similar tokens are optimal strategies for token compression. To overcome this challenge, this work proposes a token compression framework: joint merging and pruning (JMP), which adaptively selects a better token compression strategy based on the similarity between critical tokens and non-critical tokens in each sample. JMP effectively reduces computational complexity while maintaining model performance and does not require the introduction of additional trainable parameters, achieving a good balance between efficiency and performance. Taking DeiT-S as an example, JMP reduces floating point operations by 35% and increases throughput by more than 45% while only decreasing accuracy by 0.2% on ImageNet.

show abstract

Joint merging and pruning: adaptive selection of better token compression strategy

Peng,

Zeng,

Zhang

et al. 2024

J. Electron. Imag.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lightweight ship detection method based on Swin-YOLOFormer

Cited by 1 publication

References 25 publications

Joint merging and pruning: adaptive selection of better token compression strategy

Joint merging and pruning: adaptive selection of better token compression strategy

Contact Info

Product

Resources

About