“…This was studied by [15,20,21,3] in the context of MDE, and by [47,28,45,73,13] for the case where G θ is a neural network in particular. It was also used by [55,64,49,42,12] in the context of ABC. The two other discrepancies we will consider are the Wasserstein distance, as well as its relaxation called the Sinkhorn divergence.…”