“…Inspired by neural machine translation, early works apply the sequence-to-sequence model to this task and achieve promising results (Ritter et al, 2011;Shang et al, 2015;Vinyals and Le, 2015). Since then, various architectures have been proposed to address the key challenges in open-domain dialogue systems, including suppressing the generic responses (Li et al, 2015;Zhao et al, 2017;Xing et al, 2017a), context modeling (Serban et al, 2016Xing et al, 2017b;Zhang et al, 2019a), controlling the attributes of responses (Xu et al, 2019;Zhou et al, 2017;Zhang et al, 2018a;Wang et al, 2018;See et al, 2019) and incorporating different types knowledge into generation (Li et al, 2016;Zhang et al, 2018b;Zhou et al, 2017;Zhao et al, 2020). In this work, we study the problem of stylized response generation, which aims to incorporate the style information from non-parallel data into the generation process.…”