“…The item difficulty of the verbal performance items ranged from very difficult (pi = 0.06) to moderate (pi = 0.61). In line with previous research suggesting that performance items need to be difficult to trigger stereotype threat effects (Keller, 2007), and research defining pi < 30 as difficult (e.g., Shete et al, 2015), we analyzed only the performance results on the more difficult verbal performance task. 1 In our verbal performance task ("analogies"), which was adapted from the I-S-T 2000 R test (Amthauer et al, 2001), participants had to identify the relationship between two given words and then apply that rule to choose a word out of five possible alternatives that shows a similar relationship (i.e., analogy) to another given word (Beauducel, Liepmann, Horn, & Brocke, 2010).…”