Third International Conference on Computer Vision and Information Technology (CVIT 2022) 2023
DOI: 10.1117/12.2669907
|View full text |Cite
|
Sign up to set email alerts
|

Convolutions vs. Sequences: Understanding performances of neural-based methods for automatic Baybayin script recognition

Abstract: Common approaches to vision-based tasks such as character and object recognition use Convolutional Neural Networks (CNNs) due to their practicality in processing images and theoretical grounding. In this work, we take a different perspective in the task of Baybayin script recognition by exploring Vision Transformers, a new paradigm for processing images inspired by the Transformer model. We compare performances of CNNs and ViT and analyzed model confidence on a set of test images using Local Interpretable Mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 8 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?