“…Particularly for historical texts and despite notable improvements over time (Smith and Cordell, 2018), error rates can be very high, with largely unknown biasing consequences for end users (Alex et al, 2012;Milligan, 2013;Strange et al, 2014;Cordell, 2017;Jarlbrink and Snickars, 2017;Traub et al, 2018;Cordell, 2019). Consequently, assessing and improving OCR quality has been, and still is, a key area for research and development (Alex and Burns, 2014;Ehrmann et al, 2016;Smith and Cordell, 2018;Nguyen et al, 2019;Hakala et al, 2019).…”