Statistical Signifcance Testing -or Null Hypothesis Signifcance Testing (NHST) -is common to quantitative CHI PLAY research. Drawing from recent work in HCI and psychology promoting transparent statistics and the reduction of questionable research practices, we systematically review the reporting quality of 119 CHI PLAY papers using NHST (data and analysis plan at OSF.io). We fnd that over half of these papers employ NHST without specifc statistical hypotheses or research questions, which may risk the proliferation of false positive fndings. Moreover, we observe inconsistencies in the reporting of sample sizes and statistical tests. These issues refect fundamental incompatibilities between NHST and the frequently exploratory work common to CHI PLAY. We discuss the complementary roles of exploratory and confrmatory research, and provide a template for more transparent research and reporting practices.
CCS CONCEPTS• Human-centered computing → Empirical studies in HCI .