2011
DOI: 10.1177/0146621611428447
|View full text |Cite
|
Sign up to set email alerts
|

A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

Abstract: A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into account, thus estimating DIF magnitude appropriately when a test is composed of testlets. A fully Bayesian estimation method was adopted for parameter estimation. The recove… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
22
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(23 citation statements)
references
References 30 publications
1
22
0
Order By: Relevance
“…The effect sizes indicated that DIF type, DIF magnitude, and sample size were highly influential on rejection rates, whereas test length did not seem to have a strong impact on rejection rates. These results of MANOVA were consistent with the patterns of the rejection rates presented in Tables 2 and 3. dIsCUssIoN There have been several methods proposed for the detection of DIF in multidimensional item response data (e.g., Stout et al, 1997;Mazor et al, 1998;Fukuhara and Kamata, 2011;Suh and Cho, 2014;Lee et al, 2016). Among these methods, the logistic regression, the MIMIC-interaction model, and the IRT-LR test are the most readily available for detecting DIF in dichotomously and polytomously scored items because of their ease of use.…”
Section: Latent Mean Differencementioning
confidence: 99%
“…The effect sizes indicated that DIF type, DIF magnitude, and sample size were highly influential on rejection rates, whereas test length did not seem to have a strong impact on rejection rates. These results of MANOVA were consistent with the patterns of the rejection rates presented in Tables 2 and 3. dIsCUssIoN There have been several methods proposed for the detection of DIF in multidimensional item response data (e.g., Stout et al, 1997;Mazor et al, 1998;Fukuhara and Kamata, 2011;Suh and Cho, 2014;Lee et al, 2016). Among these methods, the logistic regression, the MIMIC-interaction model, and the IRT-LR test are the most readily available for detecting DIF in dichotomously and polytomously scored items because of their ease of use.…”
Section: Latent Mean Differencementioning
confidence: 99%
“…Moreover, on reviewing the DIF studies available in literature, it was concluded that the DIF was found to be trivial at the item level with analyses without considering the testlet effects and became clear with analyses considering the testlet effects (Fukuhara, 2009;Fukuhara & Kamata, 2011;Sedivy, 2009;Wainer et al, 1991). This does not lead to serious differences when there is large enough data sample ; but as the sample and the DIF becomes smaller, the methods considering the testlet effects displays better performance.…”
Section: Discussion and Recommendationsmentioning
confidence: 99%
“…In other words, through this method, the differential testlet functioning (DTF) is derived. Therefore, specific items which lead to DIF cannot be determined (Fukuhara & Kamata, 2011). Since constructing a testlet is demanding, time-consuming work, taking the testlet displaying DTF out of the item pool would not be a desirable case.…”
Section: Differential Item Functioning (Dif)mentioning
confidence: 99%
See 2 more Smart Citations