Measuring Patient-Reported Outcomes Adaptively: Multidimensionality Matters!

Paap, Muirne C. S.; Kroeze, Karel A.; Glas, Cornelis A.W.; Terwee, Caroline B.; Palen, Job van der; Veldkamp, Bernard P.

doi:10.1177/0146621617733954

Cited by 10 publications

(8 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The first approach we propose leaves the traditional item-selection criterion intact, but dynamically restricts the remaining item pool from which items can be selected. This can be done in two ways, which we will label “hard restriction” and “soft restriction.” Hard restriction implies that, once the precision threshold δ has been met for dimension q , the remaining items loading on that dimension can no longer be selected throughout the CAT, which virtually results in a new item pool S k * (see, for example, Paap et al, 2018; Yao, 2013). Under soft restriction, only items can be selected that have a non-zero loading a id on at least one dimension that does not yet meet the precision threshold δ at the given iteration during the CAT administration (see, for example, Paap et al, 2019).…”

Section: Refining Item-selection Rules For Fixed-precision Mcatmentioning

confidence: 99%

“…Under hard restriction, the item pool restriction is permanent and not reversible, because it assumes an implied monotonic decrease of the marginal SE s during the CAT administration that is mathematically not strictly guaranteed. Note that both variants have been applied in specific studies as ad hoc solutions (see, for example, Paap et al, 2018; Yao, 2013), but their performance has not been formally evaluated or compared with other approaches.…”

Section: Refining Item-selection Rules For Fixed-precision Mcatmentioning

confidence: 99%

“…An empirical multidimensional item bank of I = 194 items (4 or 5 ordinal response categories), designed to measure different aspects of quality of life (Paap et al, 2018), was used as a basis for the second study. Four dimensions were measured: fatigue (50 items), disease-specific complaints (46 items), physical function (63 items), and social roles and activities (35 items).…”

Section: Study Ii: a Real Item Bank Examplementioning

confidence: 99%

See 2 more Smart Citations

Making Fixed-Precision Between-Item Multidimensional Computerized Adaptive Tests Even Shorter by Reducing the Asymmetry Between Selection and Stopping Rules

Braeken

Paap

2020

Applied Psychological Measurement

Self Cite

View full text Add to dashboard Cite

Fixed-precision between-item multidimensional computerized adaptive tests (MCATs) are becoming increasingly popular. The current generation of item-selection rules used in these types of MCATs typically optimize a single-valued objective criterion for multivariate precision (e.g., Fisher information volume). In contrast, when all dimensions are of interest, the stopping rule is typically defined in terms of a required fixed marginal precision per dimension. This asymmetry between multivariate precision for selection and marginal precision for stopping, which is not present in unidimensional computerized adaptive tests, has received little attention thus far. In this article, we will discuss this selection-stopping asymmetry and its consequences, and introduce and evaluate three alternative item-selection approaches. These alternatives are computationally inexpensive, easy to communicate and implement, and result in effective fixed-marginal-precision MCATs that are shorter in test length than with the current generation of item-selection approaches.

show abstract

Section: Refining Item-selection Rules For Fixed-precision Mcatmentioning

confidence: 99%

Section: Refining Item-selection Rules For Fixed-precision Mcatmentioning

confidence: 99%

Section: Study Ii: a Real Item Bank Examplementioning

confidence: 99%

See 1 more Smart Citation

Making Fixed-Precision Between-Item Multidimensional Computerized Adaptive Tests Even Shorter by Reducing the Asymmetry Between Selection and Stopping Rules

Braeken

Paap

2020

Applied Psychological Measurement

Self Cite

View full text Add to dashboard Cite

show abstract

“…La experiencia de algunos investigadores ha resultado prometedora, demostrando una ganancia incremental con respecto a los TAIs unidimensionales en la eficiencia de las mediciones (e.g. Makransky et al, 2013;Paap et al, 2017). No obstante, todavía se discuten aspectos vinculados a la determinación del algoritmo adaptativo como el método de selección de ítems (Smits, Paap, & Böhnke, 2018;Tu, Han, Cai & Gao, 2018) y criterios de interrupción (Wang, Chang, & Boughton, 2013;Yao, 2013).…”

Section: Comentariosunclassified

Construcción de un banco de ítems de facetas de neuroticismo para el desarrollo de un test adaptativo

2019

View full text Add to dashboard Cite

El objetivo de este trabajo fue elaborar un banco de ítems para medir las facetas del Neuroticismo basado en el Modelo de los Cinco Factores (McCrae & Costa, 2010) y examinar la viabilidad de una administración adaptativa. Se inició con un pool de 90 ítems, que fue reducido a 54 (nueve por faceta) por juicio experto y estudio piloto. La versión depurada se administró a 1147 adultos de población general del área metropolitana de Buenos Aires (52.7% mujeres). Un 70% de los casos se usó para: a) calibrar los ítems de cada faceta por separado con el Modelo de Respuesta Graduada de Samejima (2010), b) estudiar la estructura interna del banco con un Análisis Factorial Confirmatorio y c) obtener evidencias de validez concurrente. El alfa ordinal de las facetas osciló entre .76 y .87. Con el 30% restante de casos se efectuó una simulación de administración adaptativa con criterio de parada de error ≤ 0.50. Se concluye que el banco reúne evidencias de validez y confiabilidad aceptables para su administración en un formato convencional, pero se necesita incorporar más ítems si se pretende optimizar la medición de las facetas Impulsividad, Vulnerabilidad y Hostilidad. Palabras clave: neuroticismo, modelo de lo cinco factores, banco de ítems, test adaptativo informatizado, teoría de respuesta al ítem. Constructing a bank of neuroticism facet items for the development of an adaptive test

show abstract

“…These studies showed that multidimensional CATs were 25–33% shorter [ 41 – 43 ]. Two recent studies [ 44 , 45 ] in the context of health measurement showed that the efficiency gains reported for achievement measurement seem to generalize to health measurement: between-item multidimensional CATs were on average 20–38% shorter compared to using separate unidimensional CATs when between-dimension correlations were high ( r > .76). For weaker correlations ( r = .56), multidimensional CATs were on average 17% shorter than unidimensional CATs [ 45 ].…”

Section: Efficiency and Precision Of An Item Bankmentioning

confidence: 99%

Some recommendations for developing multidimensional computerized adaptive tests for patient-reported outcomes

2018

Self Cite

View full text Add to dashboard Cite

PurposeMultidimensional item response theory and computerized adaptive testing (CAT) are increasingly used in mental health, quality of life (QoL), and patient-reported outcome measurement. Although multidimensional assessment techniques hold promises, they are more challenging in their application than unidimensional ones. The authors comment on minimal standards when developing multidimensional CATs.MethodsPrompted by pioneering papers published in QLR, the authors reflect on existing guidance and discussions from different psychometric communities, including guidelines developed for unidimensional CATs in the PROMIS project.ResultsThe commentary focuses on two key topics: (1) the design, evaluation, and calibration of multidimensional item banks and (2) how to study the efficiency and precision of a multidimensional item bank. The authors suggest that the development of a carefully designed and calibrated item bank encompasses a construction phase and a psychometric phase. With respect to efficiency and precision, item banks should be large enough to provide adequate precision over the full range of the latent constructs. Therefore CAT performance should be studied as a function of the latent constructs and with reference to relevant benchmarks. Solutions are also suggested for simulation studies using real data, which often result in too optimistic evaluations of an item bank’s efficiency and precision.DiscussionMultidimensional CAT applications are promising but complex statistical assessment tools which necessitate detailed theoretical frameworks and methodological scrutiny when testing their appropriateness for practical applications. The authors advise researchers to evaluate item banks with a broad set of methods, describe their choices in detail, and substantiate their approach for validation.

show abstract

Measuring Patient-Reported Outcomes Adaptively: Multidimensionality Matters!

Cited by 10 publications

References 26 publications

Making Fixed-Precision Between-Item Multidimensional Computerized Adaptive Tests Even Shorter by Reducing the Asymmetry Between Selection and Stopping Rules

Making Fixed-Precision Between-Item Multidimensional Computerized Adaptive Tests Even Shorter by Reducing the Asymmetry Between Selection and Stopping Rules

Construcción de un banco de ítems de facetas de neuroticismo para el desarrollo de un test adaptativo

Some recommendations for developing multidimensional computerized adaptive tests for patient-reported outcomes

Contact Info

Product

Resources

About