“…We drew from language of the NGSS, McNeill and Krajcik (2012) and Moje et al (2004) and our own previous work , to design the rubric and address reliability and validity. One of the major challenges to rubric reliability is interrater consistency (Brown, Bull, & Pendlebury, 1997), with alpha scores for interrater agreement greater than .70 considered sufficient (Jonsson & Svingby, 2017). In this study, three authors scored the display boards, which met the acceptable levels of agreement for interrater reliability (Baker, Abedi, Linn, & Niemi, 1996), with an interrater reliability of 0.88.…”