2024
DOI: 10.3390/make6010023
|View full text |Cite
|
Sign up to set email alerts
|

VisFormers—Combining Vision and Transformers for Enhanced Complex Document Classification

Subhayu Dutta,
Subhrangshu Adhikary,
Ashutosh Dhar Dwivedi

Abstract: Complex documents have text, figures, tables, and other elements. The classification of scanned copies of different categories of complex documents like memos, newspapers, letters, and more is essential for rapid digitization. However, this task is very challenging as most scanned complex documents look similar. This is because all documents have similar colors of the page and letters, similar textures for all papers, and very few contrasting features. Several attempts have been made in the state of the art to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 48 publications
0
0
0
Order By: Relevance