2014
DOI: 10.1007/978-3-319-11758-4_41
|View full text |Cite
|
Sign up to set email alerts
|

Character-Level Alignment Using WFST and LSTM for Post-processing in Multi-script Recognition Systems - A Comparative Study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 8 publications
0
5
0
Order By: Relevance
“…Unfortunately, the version of reCAPTCHA allowing post-correction has been shutdown since March 2018. 8 Collaborative post-OCR processing approaches prove their benefits with relatively high accuracy, and are cost effective. In addition, they can be easily applied to other digitisation projects.…”
Section: Manual Approachesmentioning
confidence: 99%
See 2 more Smart Citations
“…Unfortunately, the version of reCAPTCHA allowing post-correction has been shutdown since March 2018. 8 Collaborative post-OCR processing approaches prove their benefits with relatively high accuracy, and are cost effective. In addition, they can be easily applied to other digitisation projects.…”
Section: Manual Approachesmentioning
confidence: 99%
“…Instead of aligning OCR versions of the same scan, an approach of Wemhoener et al [163] enables to create a sequence alignment of OCR outputs with the scans of different copies of the same book, or its different editions. Al Azawi et al [4,8] apply Line-to-Page alignment that aligns each line of the 1st OCR with the whole page of the second OCR using Weighted Finite-State Transducers (WFST).…”
Section: Isolated-word Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…(Reul et al, 2018) make use of a voting based approach to post-correct OCR-ed early printed books in which multiple models are trained and then a decision based on confidences values for recognized characters and their alternatives is made. (Wemhoener, Yalniz, & Manmatha, 2013) also rely on a voting based decision based on the alignment of multiple OCR outputs derived from copies of the same source documents, while (Al Azawi, Ul Hasan, Liwicki, & Breuel, 2014;Al Azawi, Liwicki, & Breuel, 2015) utilize LSTM networks to determine the final output tokens and evaluate their method on a data set consisting of more modern documents (Guyon, Haralick, Hull, & Phillips, 1997). Another approach which is based on the recognition of text reuse and the alignment of these repeated passages is that of (Xu & Smith, 2017) which was tested on a collection of 19th century newspapers.…”
Section: General Models -Cross-validation Experiments For a Large Doc...mentioning
confidence: 99%
“…The LSTM networks configuration will be explained in detail in Section IV-C. The trained language model using LSTM networks was implemented by Al Azawi et al [12]. The trained language model was applied on single OCR output.…”
Section: B Strings Encodingmentioning
confidence: 99%