“…Typically, studies of L2 production rely upon either spectrographic analysis to compare the learners' speech with a native-speaker baseline (see Simonet 2011Simonet , 2012 or, alternatively, rely upon native speaker judgments (Moyer 2007). Studies that rely upon listener evaluations are necessarily more holistic in nature and given this, the selection of raters is an important decision since raters with more experience will tend to listen for different things than those with less formal experience.…”