“…Modern incarnations of autoregressive models include, among others, recurrent neural networks (RNN) [69,70], Pixel Convolutional Neural Networks (PixelCNN) [71], Transformers [67]. Recent work has effectively applied these models to quantum systems [50,51,65,66,72]. Here, we use an autoregressive Transformer, which follows the same architecture as the model in [66].…”