“…Empirical approaches to quantify instructional sensitivity emerged in the context of criterion‐referenced testing (e.g., Popham, ). Polikoff () recently categorized the numerous approaches according to the evidence used: (1) expert judgment (e.g., Popham, ), (2) instructional measures (e.g., D'Agostino et al., ), or (3) item statistics (e.g., Cox & Vargas, ; Linn & Harnisch, ). To date, the validity of ratings on instructional sensitivity seems rather unclear, and it still remains unknown which instructional measures really are relevant for the investigation of instructional sensitivity (Polikoff, ).…”