“…Given that the SPR task also included data for each sentence segment, 168 reliability estimates were available. For the 84 estimates pertaining to L1 English participants (see also Mifka-Profozic et al, 2020), internal consistency was high overall (Mdn = .94, range: .43-.99, IQR = 0.06), as well as for congruent and incongruent items (42 estimates each, respectively, Mdn = .94, .93, IQR = 0.04, 0.08), different types of modality (28 estimates per type, ability: Mdn = .95, IQR = 0.06; epistemic: Mdn = .94, IQR = 0.04; permission: Mdn = .92, SD = 0.16), stimuli versions 1 and 2 (42 estimates each, respectively, Mdn = .97, IQR = 0.03 and Mdn = .91, IQR = 0.11), and across all six segments (Mdn = .92 to .97, IQR = 0.04 to 0.10). For the L2 English participants, internal consistency was similarly high overall for the 84 estimates (Mdn = .96, range: .86-.99, IQR = 0.05), for congruent and incongruent items (42 estimates each, respectively, Mdn = .96, .96, IQR = 0.03, 0.05), different modality types (28 estimates per type, ability: Mdn = .96, IQR = 0.03; epistemic: Mdn = .96, IQR = 0.03; permission: Mdn = .97, SD = 0.07), for stimuli versions 1 and 2 (42 estimates each, respectively, Mdn = .97, IQR = 0.03 and Mdn = .95, IQR = 0.05), and across all six segments (Mdn = .92 to .98, IQR = 0.01 to 0.05).…”