“…Ficler and Goldberg (2017) focus on modulating formality depending on context, while others focus on the personalisation of language models, such as reflecting author personality in machine translation Mirkin and Meunier, 2015;Rabinovich et al, 2017); providing financial recommendations via chat bots (Den Hengst et al, 2019); or enabling customised online shopping (Mo et al, 2016). Most studies target human preferences assumed to be commonly-held and stable, such as word order (Futrell and Levy, 2019), sense making (De Deyne et al, 2016;Seminck and Amsili, 2017) and vocabulary matching (Campano et al, 2014;Dubuisson Duplessis et al, 2017). In contrast, Nguyen et al (2017) and Kreutzer et al (2017) acknowledge the noisiness of human feedback but attempt to extract a single, unified preference.…”