2017
DOI: 10.1117/12.2266984
|View full text |Cite
|
Sign up to set email alerts
|

Classification of photographed document images based on deep-learning features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 6 publications
0
4
0
Order By: Relevance
“…As shown in Table 1, several popular CNN architectures, such as AlexNet [32], VGGNet [33], and ResNet [34], have been widely used in research. We experimented with various architectures to investigate the applicability and usability of our dataset and report all results to provide an in depth review of the applicability and limitations of the proposed dataset.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…As shown in Table 1, several popular CNN architectures, such as AlexNet [32], VGGNet [33], and ResNet [34], have been widely used in research. We experimented with various architectures to investigate the applicability and usability of our dataset and report all results to provide an in depth review of the applicability and limitations of the proposed dataset.…”
Section: Related Workmentioning
confidence: 99%
“…We used five CNN architectures: ResNet-18 [38], AlexNet [32], VGG-16 [33], SqueezeNet [39], and DenseNet 161 [40]. All models were pretrained on the 1000-class ImageNet dataset [41].…”
Section: Experiments a System Setups And Model Parametersmentioning
confidence: 99%
“…Accordingly, this feature was called deep convolutional activation feature (DeCAF). In [95] Recently, the deep learning models obtained much attention are recurrent neural networks (RNNs) [79,97], long short-term memory (LSTM) [98,99], attention based models [100,101] and generative adversarial nets [102]. The applications are generally focused on image classification, object detection, speech recognition, handwriting recognition, image caption generation and machine translation [103,104,105].…”
Section: Deep Learning Modelsmentioning
confidence: 99%
“…They tend to use multiple learning algorithms for better predictive performance compared with any other constituent learning algorithms alone [2][3][4][5]. In particular, even the deep learning models that have already successfully applied in many fields [6][7][8][9], are also popular to use the idea of ensemble learning for improving their performance [10][11][12][13]. For instance, some work adopts the ensemble of deep networks to perform classification or detection tasks [14,15].…”
Section: Introductionmentioning
confidence: 99%