Ocean Sensing and Monitoring XIV 2022
DOI: 10.1117/12.2619134
|View full text |Cite
|
Sign up to set email alerts
|

Improving underwater object classification: BC-ViT

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 0 publications
0
4
0
Order By: Relevance
“…Recent research on automatic target detection has transitioned to focus on machine learning based approaches using modern techniques such as advanced sensors, [6][7][8][9] deep learning, [10][11][12] convolutional neural networks, 2, 13, 14 and more recently vision transformers. 15 There are two sensors commonly used to capture information for ATD/R systems: Synthetic Aperture Radars (SAR) and Infrared (IR). 2,9 These sensors collect data, typically in the form of a frame-by-frame video in various environments allowing a multitude of targets to be seen then fed into neural networks to accomplish tasks of detection, recognition, and motion prediction.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Recent research on automatic target detection has transitioned to focus on machine learning based approaches using modern techniques such as advanced sensors, [6][7][8][9] deep learning, [10][11][12] convolutional neural networks, 2, 13, 14 and more recently vision transformers. 15 There are two sensors commonly used to capture information for ATD/R systems: Synthetic Aperture Radars (SAR) and Infrared (IR). 2,9 These sensors collect data, typically in the form of a frame-by-frame video in various environments allowing a multitude of targets to be seen then fed into neural networks to accomplish tasks of detection, recognition, and motion prediction.…”
Section: Introductionmentioning
confidence: 99%
“…This paper proposes an efficient target detection network that utilizes the overarching theme of the SegFormer most notably the MIX FFN and multi scale resolution as well as the convolution based transformer block introduced in BC-ViT. 15 The merger of these two networks allow for tiny network, Edge IR Vision Transformer (EIR-ViT), at approximately 115 thousand parameters that is tasked with the sole purpose to detect cars utilizing the FLIR dataset for training and validation.…”
Section: Introductionmentioning
confidence: 99%
“…This paper proposes a new architecture, MHATT, for 1D classification of time-series data. Insights were drawn from techniques employed by BC-ViT, 7 which advanced the field of 2D classification, for the creation of this architecture. MHATT uses highly modified vision transformers to improve performance while keeping the network highly efficient.…”
Section: Introductionmentioning
confidence: 99%
“…To handle poor lighting and weather, EIR-ViT utilizes thermal images that are not affected by these conditions. To handle low resolution, we leverage the BC-ViT [12], which was able to obtain remarkable binary classification scores with a massively decimated input image.…”
Section: Introductionmentioning
confidence: 99%