2019
DOI: 10.1016/j.heliyon.2019.e02613
|View full text |Cite
|
Sign up to set email alerts
|

Effective and fast binarization method for combined degradation on ancient documents

Abstract: Document image binarization is a challenging task because of combined degradation in a document. In this study, a new binarization method is proposed for binarizing an ancient document with combined degradation. The proposed method comprises the following four stages: histogram analysis, contrast enhancement, local adaptive thresholding, and artifact removal. In histogram analysis, a new approach is applied to establish a uniform background. Next, the image contrast is enhanced using a new contrast enhancement… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 19 publications
(5 citation statements)
references
References 24 publications
0
5
0
Order By: Relevance
“…Preprocessing techniques for image enhancement in OCR for image-to-LaTeX conversion are crucial for improving the accuracy of the recognition process [22,23]. These techniques address challenges related to image quality, noise, contrast, skew, and multimodal features.…”
Section: Preprocessing Techniques For Image Enhancementmentioning
confidence: 99%
See 2 more Smart Citations
“…Preprocessing techniques for image enhancement in OCR for image-to-LaTeX conversion are crucial for improving the accuracy of the recognition process [22,23]. These techniques address challenges related to image quality, noise, contrast, skew, and multimodal features.…”
Section: Preprocessing Techniques For Image Enhancementmentioning
confidence: 99%
“…Various error detection methodologies include statistical analysis, linguistic analysis, and pattern matching. A study compared the OCR output with statistical models to identify discrepancies [22]. Statistical techniques can identify potential errors based on their deviation from the expected patterns by analyzing the frequency and distribution of symbols.…”
Section: Post-processing Techniques For Error Correction In Ocr For I...mentioning
confidence: 99%
See 1 more Smart Citation
“…The adaptive ability of the algorithm, therefore, is not good enough and there is still much room for improvement. Saddami et al proposed a new binarization method using a novel local adaptive threshold to extract information from nonuniform illumination images [19]. The new local threshold is an adaptive mean value.…”
Section: Related Workmentioning
confidence: 99%
“…Here, the digitalization may generate additional, frequently heavy noise on the document. Inherently, most degraded ancient documents contain multiple degradation types within a single document [ 5 ] and additional digitalization noise, which can lead to the failure of simple image analysis.…”
Section: Introductionmentioning
confidence: 99%