“…(Wang et al, 2015;Vanderwende et al, 2015;Peng et al, 2015;Pust et al, 2015;Artzi et al, 2015;Flanigan et al, 2014;Werling et al, 2015). In contrast, we follow the spirit of minimal feature extraction using pre-trained word embeddings, as in (Collobert et al, 2011) and a recurrent network architecture similar to that described in (Zhou and Xu, 2015).…”