“…Recently, much effort has also been directed towards learning representations for larger pieces of text, with methods ranging from clever compositions of word embeddings (Mitchell and Lapata, 2008;De Boom et al, 2016;Arora et al, 2017;Wieting et al, 2016;Wieting and Gimpel, 2018;Zhelezniak et al, 2019) to sophisticated neural architectures (Le and Mikolov, 2014;Kiros et al, 2015;Conneau et al, 2017;Gan et al, 2017;Tang et al, 2017;Zhelezniak et al, 2018;Subramanian et al, 2018;Pagliardini et al, 2018;Cer et al, 2018).…”