“…This rough online training is successful because normalizing flows, or invertible networks (INNs) [19,20], are especially well-suited, stable, and precise in LHC physics applications [21]. This has been shown in many instances, including event generation [22][23][24][25], detector simulations [26][27][28][29], unfolding or inverse simulations [20,30], kinematic reconstruction [31], Bayesian inference [32,33], or inference using the matrix element method [34]. On the other hand, for expensive integrands online training is clearly not optimal, because it does not make use of all previously generated data at subsequent stages of the network training.…”