“…Numerous statistical procedures have been developed to evaluate item fit under an IRT model, and goodness‐of‐fit studies have been conducted and reported in the voluminous IRT literature (Bock, 1972; Douglas & Cohen, 2001; Glas & Suarez‐Falcon, 2003; Liang & Wells, 2007; McKinley & Mills, 1985; Orlando & Thissen, 2000, 2003; Sinharay, 2003, 2005; Stone, 2000; Stone & Zhang, 2003; Suarez‐Falcon & Glas, 2003; Wells, 2004; Yen, 1981). Among them, several Chi‐square‐based item‐level goodness‐of‐fit indices using significance tests such as Yen's Q 1 for dichotomous items, the traditional log‐likelihood Chi‐square, G 2 , for both dichotomous and polytomous items (McKinley & Mills), and Orlando and Thissen's S‐X 2 for dichotomous items have been used for IRT applications.…”