“…Snorkel (Ratner et al, 2017), Hard/easy sets (Gururangan et al, 2018) Errudite Compositional-sensitivity Transformations NLPAug (Ma, 2019) Counterfactuals (Kaushik et al, 2019), Stress test (Naik et al, 2018), Bias factors (Sanchez et al, 2018) (Cooper et al, 1994), RTE (Dagan et al, 2005), SICK (Marelli et al, 2014), SNLI , MNLI (Williams et al, 2018), Checklist (Ribeiro et al, 2020) HANS (McCoy et al, 2019b), Quantified NLI (Geiger et al, 2018), MPE (Lai et al, 2017), EQUATE (Ravichander et al, 2019), DNC , ImpPres (Jeretic et al, 2020), Systematicity (Yanaka et al, 2020) ConjNLI (Saha et al, 2020), SherLIiC (Schmitt and Schütze, 2019) Example: author new movie reviews in the style of a newspaper columnist.…”