2010 20th International Conference on Pattern Recognition 2010
DOI: 10.1109/icpr.2010.498
|View full text |Cite
|
Sign up to set email alerts
|

OCR Post-processing Using Weighted Finite-State Transducers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0
2

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 34 publications
(26 citation statements)
references
References 11 publications
0
24
0
2
Order By: Relevance
“…To propagate the outputs in the beam efficiently, a number of efficient structures have been devised to compactly encode certain families of distributions [21,17]. Efficient encodings for top-k lists can improve the scalability of our approach as well.…”
Section: Related Workmentioning
confidence: 99%
“…To propagate the outputs in the beam efficiently, a number of efficient structures have been devised to compactly encode certain families of distributions [21,17]. Efficient encodings for top-k lists can improve the scalability of our approach as well.…”
Section: Related Workmentioning
confidence: 99%
“…Although the requirements are very different, most basic techniques used in that field can be applied to OCR tasks with little modification. Thus, several works use language modeling techniques for error-correcting applied to OCR and text recognition tasks, either on constrained or unconstrained environments Hull & Srihari (1982); Tong & Evans (1996); Perez-Cortes et al (2000); Kolak & Resnik (2005); Llobet et al (2010). Confidence measures reflecting the likelihood that a given OCR hypothesis belongs to the model are provided by many of them.…”
Section: Large-scale Ocr Systems and Post-processingmentioning
confidence: 99%
“…A technique based on Weighted Finite-State Transducers (WFSTs) combining language, hypothesis and error models has been used to post-process the OCR hypotheses Llobet et al (2010). It is based on a finite-state transducer built from a formal grammar that encodes the strings in the lexicon or language sample.…”
Section: Post-processing Algorithm and Language Models Usedmentioning
confidence: 99%
See 1 more Smart Citation
“…This decreases the time needed for manual post-correction since correct words do not have to be considered as candidates for correction by the human corrector. Llobet et al (2010) combine information from the OCR system output, the error distribution and the language as weighted finite-state transducers. Reffle and Ringlstetter (2013) use global as well as local error information to be able to fine-tune post-correction systems to historical documents.…”
Section: Related Workmentioning
confidence: 99%