2020
DOI: 10.3844/jcssp.2020.784.801
|View full text |Cite
|
Sign up to set email alerts
|

An Adaptive Thresholding Algorithm-Based Optical Character Recognition System for Information Extraction in Complex Images

Abstract: Extracting texts from images with complex backgrounds is a major challenge today. Many existing Optical Character Recognition (OCR) systems could not handle this problem. As reported in the literature, some existing methods that can handle the problem still encounter major difficulties with extracting texts from images with sharp varying contours, touching word and skewed words from scanned documents and images with such complex backgrounds. There is, therefore, a need for new methods that could easily and eff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 18 publications
0
7
0
Order By: Relevance
“…Haraj and Raissouni produced an average of 95.77% charcater accuracy using tesseract and opencv library over 4 sample images in 2015 [17]. Those research [14,15,16,17] only used relatively small samples (less than 50 documents), while our study used more documents (8,562 documents in 6 Categories and two document structures). Previous research [14,15,16,17], which also employed the Tesseract library, only used string matching to measure the OCR.…”
Section: Related Workmentioning
confidence: 83%
See 4 more Smart Citations
“…Haraj and Raissouni produced an average of 95.77% charcater accuracy using tesseract and opencv library over 4 sample images in 2015 [17]. Those research [14,15,16,17] only used relatively small samples (less than 50 documents), while our study used more documents (8,562 documents in 6 Categories and two document structures). Previous research [14,15,16,17], which also employed the Tesseract library, only used string matching to measure the OCR.…”
Section: Related Workmentioning
confidence: 83%
“…Similar research employed the Tesseract library [16] with only 11 images as input yielding 69.7% precision. On the other hand, our study produced 83.07% precision with 8,562 documents as the same library input.…”
Section: B Discussionmentioning
confidence: 99%
See 3 more Smart Citations