“…Previous work on parallel document mining using large distributed systems has proven effective (Uszkoreit et al, 2010;Antonova and Misyurev, 2011), but these systems are often heavily engineered and computationally intensive. Recent work on parallel data mining has focused on sentence-level embeddings (Guo et al, 2018;Artetxe and Schwenk, 2018;Yang et al, 2019). However, these sentence embedding methods have had limited success when applied to documentlevel mining tasks (Guo et al, 2018).…”