2023
DOI: 10.3390/app13158716
|View full text |Cite
|
Sign up to set email alerts
|

Vision Transformers for Anomaly Detection and Localisation in Leather Surface Defect Classification Based on Low-Resolution Images and a Small Dataset

Abstract: Genuine leather manufacturing is a multibillion-dollar industry that processes animal hides from varying types of animals such as sheep, alligator, goat, ostrich, crocodile, and cow. Due to the industry’s immense scale, there may be numerous unavoidable causes of damages, leading to surface defects that occur during both the manufacturing process and the bovine’s own lifespan. Owing to the heterogenous and manifold nature of leather surface characteristics, great difficulties can arise during the visual inspec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 42 publications
0
5
0
Order By: Relevance
“…Attention mechanisms and transformer models, initially proposed for natural language processing tasks, have been adapted for CV and AD. These models can focus on relevant parts of the input data, improving the detection of anomalies in complex scenes [46]. Transformers, with their self-attention layers, have shown remarkable success in modeling dependencies and identifying anomalies in highdimensional data [47].…”
Section: Attention Mechanisms and Transformersmentioning
confidence: 99%
“…Attention mechanisms and transformer models, initially proposed for natural language processing tasks, have been adapted for CV and AD. These models can focus on relevant parts of the input data, improving the detection of anomalies in complex scenes [46]. Transformers, with their self-attention layers, have shown remarkable success in modeling dependencies and identifying anomalies in highdimensional data [47].…”
Section: Attention Mechanisms and Transformersmentioning
confidence: 99%
“…The classification accuracy exceeded 90%. Antony et al [39] proposed the application of modern vision transformer architecture for the purposes of low-resolution image-based anomaly detection for leather surface defect classification.…”
Section: Related Workmentioning
confidence: 99%
“…Through unified evaluation of each model, a common benchmark is established, which strives to lay a foundation for researchers and engineers to select, design, or implement new schemes for the visual inspection and recognition of leather surface defects. The following classical deep learning models were mainly selected for performance evaluation: AlexNet [32], VGG [39], ReNet [30], GoogleNet [40], DenseNet [41], SqueezeNet [42], and ShuffleNet [43,44]. AlexNet won the 2012 ImageNet competition.…”
Section: Classic Deep Learning Modelsmentioning
confidence: 99%
“…The recent contributions underscore the array of approaches and techniques deployed in the realm of unsupervised visual anomaly detection for industrial applications, laying the foundation for more robust and efficient anomaly detection solutions across diverse industrial settings [24][25][26][27][28][29][30][31][32][33][34][35][36][37]. It is essential to emphasize that, distinct from unsupervised learning in other vision tasks, unsupervised anomaly detection tasks leverage anomaly-free images for training, leading to a paradigm where models inherently operate under the out-of-distribution concept.…”
Section: Unsupervised Anomaly Detectionmentioning
confidence: 99%