“…There has been extensive interest in non-autoregressive/parallel generation approaches, aiming at producing a sequence in parallel sub-linear time w.r.t. sequence length [13,52,26,65,53,14,11,12,48,15,28,16,49,55,30,41,64,62]. Existing approaches can be broadly classified as latent variable based [13,26,65,28,41], refinement-based [25,48,14,15,11,30,12,62] or a combination of both [41].…”