2017
DOI: 10.22214/ijraset.2017.9219
|View full text |Cite
|
Sign up to set email alerts
|

A Study to Recognize Printed Gujarati Characters Using Tesseract OCR

Abstract: Optical Character Recognition (OCR) is a widely-known technique to recognize the printed text using computer with the help of various peripheral devices. Research works for OCR of many languages scripts is in process and many languages are still far away. Gujarati script is one of the least focused script in research area of OCR as compared to other scripts. A wellknown Open Source OCR Engine called Tesseract which is already used for the recognition of numerous scripts, can be used to recognize printed Gujara… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 6 publications
0
4
0
Order By: Relevance
“…They utilized the existing trained data for the Gujarati script within Tesseract and aimed to shed light on its effectiveness and potential in OCR applications for the Gujarati language. (1) Jaspreet Kaur et el. conducted a study focused on recognizing typewriter-typed Hindi documents using the Tesseract OCR engine.…”
Section: Related Work Tesseract Ocrmentioning
confidence: 99%
“…They utilized the existing trained data for the Gujarati script within Tesseract and aimed to shed light on its effectiveness and potential in OCR applications for the Gujarati language. (1) Jaspreet Kaur et el. conducted a study focused on recognizing typewriter-typed Hindi documents using the Tesseract OCR engine.…”
Section: Related Work Tesseract Ocrmentioning
confidence: 99%
“…Shots of the scene are related by frame based key similarities. Some research has been done on few Indian regional language videos for information retrieval [14]. No dataset is available to work on any of the regional language videos.…”
Section: ©Ijraset (Ugc Approved Journal): All Rights Are Reservedmentioning
confidence: 99%
“…Video data normally contains audio and visual features such as color, texture, edge information, motion vectors, loudness, pitch, etc. [2,14]. In case of textual information present in video clip, the text data which are continuously being displayed for certain time gives some important information about what is currently being viewed [3,4].…”
Section: Introductionmentioning
confidence: 99%
“…Gujarati is written in the Devanagari script as well and is currently supported by The Unicode Standard [6]. Gujarati is also thriving in terms of research and development in recent years [7], [8].…”
Section: Introductionmentioning
confidence: 99%