“…This transfer learning method inspired our work and the main difference is that they required bilingual supervision (e.g, bilingual lexicon, parallel sentences), which is not available for many low-resource language pairs. Recently, several works developed unsupervised method to mine parallel data (Hangya et al, 2018;Hangya and Fraser, 2019;Kvapilíková et al, 2020;Keung et al, 2020). These approaches mainly rely on unsupervised cross-lingual embeddings (Artetxe et al, 2018;Lample and Conneau, 2019) that be trained on monolingual corpora.…”