“…It does, however, require a definition of register sufficiently precise that human annotators can label texts accordingly with high inter-rater reliability, which is not always easy to achieve. Register classification can comprise a stand-alone task (Stamatatos, Fakotakis, & Kokkinakis 2000, Biber & Conrad 2001, Argamon, Koppel, Fine, & Shimoni 2003, Finn & Kushmerick 2006, Santini 2006, Herring & Paolillo 2006, Abbasi & Chen 2007, Dong, Watters, Duffy, & Shepherd 2008, Sharoff, Wu, & Markert 2010 or may be used to derive insights into larger questions related to linguistic variation (e.g., Atkinson 1992, Argamon, Dodick, & Chase 2008, Eisenstein, Smith, & Xing 2011, Teich, Degaetano-Ortlieb, Kermes, & Lapshinova-Koltunski 2013, Clarke & Grieve 2017. Register labels, either manually or automatically assigned, can also be used to control for register in research on other text analysis methods (e.g., Carroll et al 1999, Giesbrecht & Evert 2009, Sharoff et al 2010; differences in register between training and testing data often affect outcomes for NLP tasks such as part-ofspeech tagging, parsing, or information extraction.…”