2020
DOI: 10.5829/ije.2020.33.07a.05
|View full text |Cite
|
Sign up to set email alerts
|

Learning Document Image Features With SqueezeNet Convolutional Neural Network

Abstract: The classification of various document image classes is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for training, and their very large number of weights. Previous successful attempts at learning document image feat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 13 publications
0
6
0
Order By: Relevance
“…SqueezeNet is a CNN architecture that can achieve the accuracy of AlexNet, which incidentally won the ImageNet classification task in 2012 by using fewer parameters and fast training time [16]. The SqueezeNet architecture replaces several 3x3 convolution layers with 1x1 and filters are used less to shrink the dimensions of the activation map or called squeeze [17]. Then, proceed with the convolution layer process with more filters to enlarge the activation map again, or what is called the expanding process.…”
Section: Convolutional Neural Network (Cnn)mentioning
confidence: 99%
“…SqueezeNet is a CNN architecture that can achieve the accuracy of AlexNet, which incidentally won the ImageNet classification task in 2012 by using fewer parameters and fast training time [16]. The SqueezeNet architecture replaces several 3x3 convolution layers with 1x1 and filters are used less to shrink the dimensions of the activation map or called squeeze [17]. Then, proceed with the convolution layer process with more filters to enlarge the activation map again, or what is called the expanding process.…”
Section: Convolutional Neural Network (Cnn)mentioning
confidence: 99%
“…A novel model was investigated by Gayathri and Kannan [12] on a dataset containing over 3,000 images to classify Ayurvedic documents which gave promising results. Jimenez et al [13] [14] to classify documents based on visual features with an accuracy of 77% was obtained. Kanchi et al [15] discussed a deep multi model-based approach to classify documents on datasets containing 16 and 10 classes respectively.…”
Section: Introductionmentioning
confidence: 99%
“…Their application has revolutionized the way we tackle various challenges and tasks. With their ability to analyze large amounts of data and extract meaningful features [33,34], CNNs have been extensively applied in ocean engineering, including ocean data analysis, ocean environmental monitoring, marine robotics, and autonomous systems [35][36][37][38][39][40][41][42]. Him et al [35] show that a statistical forecast model employing a CNN approach produces skilled ENSO forecasts for lead times of up to one and a half years.…”
Section: Introductionmentioning
confidence: 99%