“…This is part of a larger set of issues increasingly discussed in data‐driven NLP with regard to how linguistic data are annotated, such as the role of expert knowledge versus native‐speaker intuition (and perhaps more importantly: who counts as an expert), how to deal with variation in the annotations, and how to take factors such as these into account when setting up annotation tasks, as well as for calculating both IAA and machine learning accuracy (e.g. Babarczy et al., 2006; Bayerl & Paul, 2011; Borin, 2022; Gillick & Liu, 2010; Plank, 2022; Plank et al., 2014; Uma et al., 2021).…”