2023
DOI: 10.11591/ijece.v13i1.pp894-901
|View full text |Cite
|
Sign up to set email alerts
|

Classification of heterogeneous Malayalam documents based on structural features using deep learning models

Abstract: The proposed work gives a comparative study on performance of various pretrained deep learning models for classifying Malayalam documents such as agreement documents, notebook images, and palm leaves. The documents are classified based on their visual and structural features. The dataset was manually collected from different sources. The method of research proceeds with preprocessing, feature extraction, and classification. The proposed work deals with three fine-tuned deep learning models such as visual geome… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…Jayakumari and Nair [4] proposed a deep-learning-based ResNet model for the binarization of ancient horoscopic palm leaf images, achieving a high accuracy of 95.38% on a manually collected dataset. Bipin performed a comparative study on the performance of various pre-trained deep learning models for classifying Malayalam documents, utilizing three fine-tuned deep learning models, namely VGG-16, CNN, and AlexNet, which achieved accuracies of 99.7%, 96%, and 95%, respectively [5]. However, due to the distinctive noise in non-text regions of oracle bone script images, the image features related to text regions do not exhibit significant discrimination across different historical periods compared to other historical documents, making it difficult for the CNN model to correctly extract features.…”
Section: History Document Classification2mentioning
confidence: 99%
“…Jayakumari and Nair [4] proposed a deep-learning-based ResNet model for the binarization of ancient horoscopic palm leaf images, achieving a high accuracy of 95.38% on a manually collected dataset. Bipin performed a comparative study on the performance of various pre-trained deep learning models for classifying Malayalam documents, utilizing three fine-tuned deep learning models, namely VGG-16, CNN, and AlexNet, which achieved accuracies of 99.7%, 96%, and 95%, respectively [5]. However, due to the distinctive noise in non-text regions of oracle bone script images, the image features related to text regions do not exhibit significant discrimination across different historical periods compared to other historical documents, making it difficult for the CNN model to correctly extract features.…”
Section: History Document Classification2mentioning
confidence: 99%