The Problem of Bias in Person Parameter Estimation in Adaptive Testing

Doebler, Anna

doi:10.1177/0146621612443304

Cited by 10 publications

(12 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Strong positive correlations between true and estimated ability indicate that the relative ordering of test takers changes minimally across conditions, both with and without position effects. This suggests that position bias could be adjusted for with a linear transformation of the theta scale, as noted by Doebler (). Adjustments may be less useful when person or item parameters are not distributed normally, or when position has more than a fixed linear effect.…”

Section: Discussionmentioning

confidence: 79%

“…Building on the simulation design from Doebler (), this article extends previous research on position effects by addressing the issue of impact in two connected studies. In Study 1, position effects are modeled with operational data from a linear, fixed length assessment used in early education, an area of measurement that has received relatively limited psychometric attention, and one that presents unique and interesting challenges such as more stringent limitations on testing time and test length.…”

mentioning

confidence: 89%

“…The goal of Study 2 was to examine the impact of results such as these on a hypothetical subsequent CAT that might employ parameters like the ones from Study 1 as precalibrated values. Similar to Doebler (), CAT was simulated using different sets of generating parameters, where simulated test takers responded based on true item parameters, but the CAT algorithm made decisions based on estimates, as would take place operationally. The design of Study 2 was also informed by practical considerations associated with testing in early education and the use of brief, formative measures of preliteracy skills, such as alphabet knowledge.…”

Section: Studymentioning

confidence: 99%

“…Doebler () outlines how error in item parameter estimation can carry forward to produce bias in person parameter estimation. Within the context of CAT, discrepancies between estimated and true item difficulty are summarized as coming from four main sources: (1) random error due to sampling of test takers, that is, standard error (SE), which tends to be higher in CAT because of higher item turnover and smaller sample sizes; (2) differences across person groups, that is, DIF; (3) testlet effects, a type of context effect that can arise with item sets that share a common stimulus; and (4) automatic item generation or cloning, where variability in item parameters may be ignored for items generated from the same template.…”

mentioning

confidence: 99%

“…Within the context of CAT, discrepancies between estimated and true item difficulty are summarized as coming from four main sources: (1) random error due to sampling of test takers, that is, standard error (SE), which tends to be higher in CAT because of higher item turnover and smaller sample sizes; (2) differences across person groups, that is, DIF; (3) testlet effects, a type of context effect that can arise with item sets that share a common stimulus; and (4) automatic item generation or cloning, where variability in item parameters may be ignored for items generated from the same template. In a series of CAT simulation studies, Doebler () manipulated the first and fourth of these sources, with results demonstrating varying amounts of person parameter bias across different IRT models, estimators, test lengths, and item pool sizes. When item difficulty SE was simulated to be .25, mean bias in person ability exceeded .50 logits under some conditions.…”

mentioning

confidence: 99%

See 4 more Smart Citations

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Albano

Cai

Lease

et al. 2019

J Educational Measurement

View full text Add to dashboard Cite

Studies have shown that item difficulty can vary significantly based on the context of an item within a test form. In particular, item position may be associated with practice and fatigue effects that influence item parameter estimation. The purpose of this research was to examine the relevance of item position specifically for assessments used in early education, an area of testing that has received relatively limited psychometric attention. In an initial study, multilevel item response models fit to data from an early literacy measure revealed statistically significant increases in difficulty for items appearing later in a 20-item form. The estimated linear change in logits for an increase of 1 in position was .024, resulting in a predicted change of .46 logits for a shift from the beginning to the end of the form. A subsequent simulation study examined impacts of item position effects on person ability estimation within computerized adaptive testing. Implications and recommendations for practice are discussed.

show abstract

Section: Discussionmentioning

confidence: 79%

mentioning

confidence: 89%

Section: Studymentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 3 more Smart Citations

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Albano

Cai

Lease

et al. 2019

J Educational Measurement

View full text Add to dashboard Cite

show abstract

Effects of Item Calibration Errors on Computerized Adaptive Testing under Cognitive Diagnosis Models

Huang

2018

J Classif

View full text Add to dashboard Cite

Efficient Standard Error Formulas of Ability Estimators with Dichotomous Item Response Models

Magis

2015

Psychometrika

View full text Add to dashboard Cite

This paper focuses on the computation of asymptotic standard errors (ASE) of ability estimators with dichotomous item response models. A general framework is considered, and ability estimators are defined from a very restricted set of assumptions and formulas. This approach encompasses most standard methods such as maximum likelihood, weighted likelihood, maximum a posteriori, and robust estimators. A general formula for the ASE is derived from the theory of M-estimation. Well-known results are found back as particular cases for the maximum and robust estimators, while new ASE proposals for the weighted likelihood and maximum a posteriori estimators are presented. These new formulas are compared to traditional ones by means of a simulation study under Rasch modeling.

show abstract

The Problem of Bias in Person Parameter Estimation in Adaptive Testing

Cited by 10 publications

References 18 publications

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Effects of Item Calibration Errors on Computerized Adaptive Testing under Cognitive Diagnosis Models

Efficient Standard Error Formulas of Ability Estimators with Dichotomous Item Response Models

Contact Info

Product

Resources

About