Detecting DIF in Multidimensional Forced Choice Measures Using the Thurstonian Item Response Theory Model

Lee, Philseok; Joo, Seang‐Hwane; Stark, Stephen

doi:10.1177/1094428120959822

Cited by 17 publications

(7 citation statements)

References 87 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Finally, all items can be tested for DIF because they are on a common metric. This approach appears flexible as it was recently extended for use on multidimensional forced-choice measures (Lee et al 2021) and has been recommended for use in IRT whenever equivalent anchor items are unknown (Tay et al 2015). However, although these approaches have primarily been used in the IRT literature, they can also be generalized to the CFA framework.…”

Section: Alternative Methods For Testing Mementioning

confidence: 99%

A Review of Measurement Equivalence in Organizational Research: What's Old, What's New, What's Next?

Somaraju

Nye

Olenick

2021

Organizational Research Methods

View full text Add to dashboard Cite

The study of measurement equivalence has important implications for organizational research. Nonequivalence across groups or over time can affect the results of a study and the conclusions that are drawn from it. As a result, the review paper by Vandenberg & Lance (2000) has been highly cited and has played an important role in understanding the measurement of organizational constructs. However, that paper is now 20 years old, and a number of advances have been made in the application and interpretation of measurement equivalence (ME) since its publication. Therefore, the goal of the present paper is to provide an updated review of ME techniques that describes recent advances in testing for ME and proposes a taxonomy of potential sources of nonequivalence. Finally, we articulate recommendations for applying these newer methods and consider future directions for measurement equivalence research in the organizational literature.

show abstract

Section: Alternative Methods For Testing Mementioning

confidence: 99%

A Review of Measurement Equivalence in Organizational Research: What's Old, What's New, What's Next?

Somaraju

Nye

Olenick

2021

Organizational Research Methods

View full text Add to dashboard Cite

show abstract

“…For example, in Programme for International Student Assessment (PISA) 2018 reading assessment, the item parameter differences for the discrimination and difficulty parameters across the participating countries ranged from .01 to .88 and from .05 to 1.17, respectively (Joo et al., 2021; OECD, 2019). In addition, we also considered the 10% and 20% DIF conditions because the proportions are commonly found in practice (e.g., Joo et al., 2021; Lee et al., 2021; Stark et al., 2004, 2006) and research settings (e.g., Kim & Cohen, 1992, 1998; Oshima et al., 1997, 2006; Rutkowski & Svetina, 2014).…”

Section: Methodsmentioning

confidence: 99%

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Joo

Lee

2022

J Educational Measurement

Self Cite

View full text Add to dashboard Cite

This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was evaluated via a Monte Carlo simulation manipulating sample size, DIF size, DIF type, DIF percentage, and subpopulation trait distribution. Parametric DIF methods, such as Lord's chi-square and Raju's area approaches, were also included in the simulation design in order to compare the performance of the proposed PPMC DIF methods to those previously existing. Based on Type I error and power analysis, we found that PPMC DIF methods showed better-controlled Type I error rates than the existing methods and comparable power to detect uniform DIF. The implications and recommendations for applied researchers are discussed.

show abstract

“…DIF is the parameter inconsistency at the item level. P. Lee et al [ 70 ] proposed an Omnibus Wald test for the discrimination and intercept indicators of the TIRT and suggested through simulation research that the detection efficiency was higher under the free baseline method: the detection rate was close to 1 and the type I error rate was close to 0.05 as sample size and DIF amount increased. Qiu & Wang [ 71 ] proposed three DIF test methods for RIM including EMD (equal-mean-difficulty), AOS (all-other-statement), and CS (constant-statement).…”

Section: Applied Researchmentioning

confidence: 99%

“…At present, there is only research on the parameter invariance of TIRT [ 69 , 70 ] and RIM [ 71 ]. Future studies should broaden the repertoire of differential item functioning (DIF) test methods for the forced-choice model and improve their sensitivity in detecting DIF from multiple sources.…”

Section: Future Researchmentioning

confidence: 99%

Multidimensional IRT for forced choice tests: A literature review

Nie,

Xu,

2024

Heliyon

View full text Add to dashboard Cite

Detecting DIF in Multidimensional Forced Choice Measures Using the Thurstonian Item Response Theory Model

Cited by 17 publications

References 87 publications

A Review of Measurement Equivalence in Organizational Research: What's Old, What's New, What's Next?

A Review of Measurement Equivalence in Organizational Research: What's Old, What's New, What's Next?

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Multidimensional IRT for forced choice tests: A literature review

Contact Info

Product

Resources

About