2000
DOI: 10.1177/01466216000241003
|View full text |Cite
|
Sign up to set email alerts
|

Likelihood-Based Item-Fit Indices for Dichotomous Item Response Theory Models

Abstract: New goodness-of-fit indices are introduced for dichotomous item response theory (IRT) models. These indices are based on the likelihoods of number-correct scores derived from the IRT model, and they provide a direct comparison of the modeled and observed frequencies for correct and incorrect responses for each number-correct score. The behavior of Pearson's χ 2 (S-X 2 ) and the likelihood ratio G 2 (S-G 2 ) was assessed in a simulation study and compared with two fit indices similar to those currently in use (… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

18
600
2
14

Year Published

2003
2003
2018
2018

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 559 publications
(634 citation statements)
references
References 21 publications
18
600
2
14
Order By: Relevance
“…For example, BILOG (Mislevy & Bock, 1998) provides item chi-square fit statistics, but these are problematic on a number of grounds and cannot be used to compare the fit of different models on an item-by-item basis. See Orlando and Thissen (2000) for further comment. where we take the sum of squared differences (x ij ) and predicted (P ij ) item responses for every cell in the Persons (J) × Items (I) matrix.…”
Section: Root-mean-square Residualsmentioning
confidence: 99%
“…For example, BILOG (Mislevy & Bock, 1998) provides item chi-square fit statistics, but these are problematic on a number of grounds and cannot be used to compare the fit of different models on an item-by-item basis. See Orlando and Thissen (2000) for further comment. where we take the sum of squared differences (x ij ) and predicted (P ij ) item responses for every cell in the Persons (J) × Items (I) matrix.…”
Section: Root-mean-square Residualsmentioning
confidence: 99%
“…Differential item functioning is detected by comparing the item responses of people who have the same underlying true value of the construct. Because this true value is not known, it is estimated using different methods, for example, the summed score across all items, or an estimate based on item response theory [31,32,34,[44][45][46]. The analyses can be carried out in two stages: initially with all items and then without items exhibiting bias [10,16].…”
Section: Testing For Differential Item Functioningmentioning
confidence: 99%
“…The χ 2 G statistic was included in the simulations, as in Dodeen (2004), as well as the S − χ 2 and S − G 2 statistics suggested by Orlando and Thissen (2000). For computing S − χ 2 and S − G 2 , the examinees were divided into G groups based on their raw scores.…”
Section: The Item Fit Statistics Consideredmentioning
confidence: 99%
“…Item fit is a major area of interest in model checking. Though researchers have suggested several different item fit statistics (e.g., Bock, 1972;Orlando & Thissen, 2000;Sinharay, 2006;Stone, 2000;Stone & Zhang, 2003;Glas & Suarez-Falcon, 2003;Yen, 1981), there is a lack of sufficient knowledge regarding factors that usually cause item misfit. For example, more appropriate assessments have resulted because the substantial existing knowledge of factors affecting differential item functioning, or DIF, (see, for example, Schmitt, Holland, & Dorans, 1993, and the references therein) often help test developers to control the number of items with DIF.…”
mentioning
confidence: 99%
See 1 more Smart Citation