“…(Reul et al, 2018) make use of a voting based approach to post-correct OCR-ed early printed books in which multiple models are trained and then a decision based on confidences values for recognized characters and their alternatives is made. (Wemhoener, Yalniz, & Manmatha, 2013) also rely on a voting based decision based on the alignment of multiple OCR outputs derived from copies of the same source documents, while (Al Azawi, Ul Hasan, Liwicki, & Breuel, 2014;Al Azawi, Liwicki, & Breuel, 2015) utilize LSTM networks to determine the final output tokens and evaluate their method on a data set consisting of more modern documents (Guyon, Haralick, Hull, & Phillips, 1997). Another approach which is based on the recognition of text reuse and the alignment of these repeated passages is that of (Xu & Smith, 2017) which was tested on a collection of 19th century newspapers.…”