“…Automated computational decipherment of such texts is challenging (Pettersson and Megyesi, 2019;Megyesi et al, 2020). Prior work has mainly focused on using clever heuristics and/or search algorithms to explore the space of cipher keys and score multiple candidate plaintexts under character language models (LMs) (Knight et al, 2006;Corlett and Penn, 2010;Hauer et al, 2014;Berg-Kirkpatrick and Klein, 2013;Nuhn et al, 2013Nuhn et al, , 2014 In contrast Aldarrab and May (2021) train a sequence-tosequence model to solve simple (one-to-one) substitution ciphers. This approach, however, cannot solve complex homophonic ciphers as it relies on frequency information which such ciphers obscure.…”