“…To foster research on dialog policy learning for virtual digital assistants, several task-oriented dialog corpora have been introduced in recent years, such as SimDial (Zhao and Eskenazi, 2018), Multi-WoZ (Budzianowski et al, 2018), Taskmaster (Byrne et al, 2019), and Schema Guided Dialog (Rastogi et al, 2019), to name a few. Deep learning approaches, including mixture models (Pei et al, 2019) hierarchical encoder/decoder Chen et al, 2019), reinforcement learning (Zhao et al, 2019), and pre-trained language models (Wu et al, 2019;Peng et al, 2020;Hosseini-Asl et al, 2020), have significantly advanced dialog policy research in the past few years , setting new state-of-the-art performance limits.…”