Structural Equation Modeling of Paired-Comparison and Ranking Data.

Maydeu-Olivares, Alberto; Böckenholt, Ulf

doi:10.1037/1082-989x.10.3.285

Cited by 107 publications

(125 citation statements)

References 41 publications

Supporting

Mentioning

124

Contrasting

Order By: Relevance

“…In turn, the Thurstonian factor model is a factor-analytic model embedded within the Law of Comparative Judgment. Although the latter was introduced by Thurstone as early as 1927, embedding a factor-analytic structure within it (and hence developing an IRT counterpart) was only possible very recently, for computational reasons (see Maydeu-Olivares, 1999, 2001Maydeu-Olivares & Böckenholt, 2005;Tsai & Böckenholt, 2001).…”

Section: Discussionmentioning

confidence: 99%

“…Thurstone ( Thurstonian factor analytic models (or Thurstonian factor models) are less well known than classical models such as Case V because their estimation has only recently become feasible (Maydeu-Olivares & Böckenholt, 2005;Tsai & Böckenholt, 2001). Thurstonian factor models are similar to second-order factor analysis models with binary outcomes.…”

Section: Thurstonian Factor Modelsmentioning

confidence: 99%

Section: Thurstonian Factor Modelsmentioning

confidence: 99%

“…The latent utilities (first-order factors), in turn, depend on psychological attributes (second-order factors). Maydeu-Olivares and Böckenholt (2005) show how to embed these models within a familiar SEM framework so that they can be easily estimated and tested. The Thurstonian IRT model is a first-order model that links the binary outcomes to the traits directly, by substituting the latent utilities with linear functions describing their relationships with the underlying traits .…”

mentioning

confidence: 99%

See 3 more Smart Citations

How IRT can solve problems of ipsative data in forced-choice questionnaires.

Brown

Maydeu-Olivares

2013

Psychological Methods

165

185

View full text Add to dashboard Cite

In multidimensional forced-choice (MFC) questionnaires, items measuring different attributes are presented in blocks, and participants have to rank-order the items within each block (fully or partially). Such comparative formats can reduce the impact of numerous response biases often affecting single-stimulus items (aka, rating or Likert scales). However, if scored with traditional methodology, MFC instruments produce ipsative data, whereby all individuals have a common total test score. Ipsative scoring distorts individual profiles (it is impossible to achieve all high or all low scale scores), construct validity (covariances between scales must sum to zero), criterion related validity (validity coefficients must sum to zero), and reliability estimates.We argue that these problems are caused by inadequate scoring of forced-choice items, and advocate the use of item response theory (IRT) models based on an appropriate response process for comparative data, such as Thurstone's Law of Comparative Judgment. We show that by applying Thurstonian IRT modeling (Brown & Maydeu-Olivares, 2011), even existing forcedchoice questionnaires with challenging features can be scored adequately and that the IRTestimated scores are free from the problems of ipsative data. Assessments of personality, social attitudes, interests, motivation, psychopathology and well-being largely rely on respondent-reported measures. Most such measures employ the socalled single-stimulus format, where respondents evaluate one question (or item) at a time, often in relation to a rating scale (i.e. Likert-type items). Because the respondents rate each item separately from other items, they make absolute judgments about the extent to which the item describes their personality, attitudes, etc. Simple to answer and score and therefore popular with test takers and test users, the single-stimulus format makes several assumptions about the respondents' rating behaviors that are often unrealistic. For instance, the use of rating scales relies on the assumption that respondents interpret category labels in the same way. This assumption is very rarely tested in practice, but research available on the issue suggests that interpretation and meaning of response categories vary from one respondent to another (Friedman & Amoo, 1999). Furthermore, individual response styles may vary (Van Herk, Poortinga & Verhallen, 2004) so that some respondents avoid extreme categories (central tendency responding), whereas others prefer them (extreme responding). Sometimes respondents tend to agree with both positive and negative statements as presented (acquiescence bias).Another common problem is getting respondents to differentiate between ratings they give to single-stimulus items. When rating another person's attributes or behavior (as in the 360-degree feedback), respondents commonly give either high or low ratings on all behaviors (halo/horn effect) depending on whether they judge the person to score high or low on a single important dimension. Typically, respon...

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Thurstonian Factor Modelsmentioning

confidence: 99%

Section: Thurstonian Factor Modelsmentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

How IRT can solve problems of ipsative data in forced-choice questionnaires.

Brown

Maydeu-Olivares

2013

Psychological Methods

165

185

View full text Add to dashboard Cite

show abstract

“…Conceptualizing the observed variables in compositional questionnaires as the differences of item utilities provides a straightforward connection to established models of choice dataThurstonian factor models (Maydeu-Olivares & Böckenholt, 2005). These models have been applied to forced-choice questionnaires (Brown & Maydeu-Olivares, 2011a;, where the observed comparative judgments are binary (prefer item i or item k).…”

Section: Thurstonian Factor Models For Differences Of Utilitiesmentioning

confidence: 99%

Thurstonian Scaling of Compositional Questionnaire Data

Brown

2016

Multivariate Behavioral Research

View full text Add to dashboard Cite

To prevent response biases, personality questionnaires may use comparative response formats. These include forced choice, where respondents choose among a number of items, and quantitative comparisons, where respondents indicate the extent to which items are preferred to each other. The present article extends Thurstonian modeling of binary choice data (Brown & Maydeu-Olivares, 2011a) to "proportion-of-total" (compositional) formats.Following Aitchison (1982), compositional item data are transformed into log-ratios, conceptualized as differences of latent item utilities. The mean and covariance structure of the log-ratios is modelled using Confirmatory Factor Analysis (CFA), where the item utilities are first-order factors, and personal attributes measured by a questionnaire are second-order factors. A simulation study with two sample sizes, N=300 and N=1000, shows that the method provides very good recovery of true parameters and near-nominal rejection rates. The approach is illustrated with empirical data from N=317 students, comparing model

show abstract

Paired Comparisons

Beins

2010

The Corsini Encyclopedia of Psychology

View full text Add to dashboard Cite

show abstract

Structural Equation Modeling of Paired-Comparison and Ranking Data.

Cited by 107 publications

References 41 publications

How IRT can solve problems of ipsative data in forced-choice questionnaires.

How IRT can solve problems of ipsative data in forced-choice questionnaires.

Thurstonian Scaling of Compositional Questionnaire Data

Paired Comparisons

Contact Info

Product

Resources

About