Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams

Yeates, Peter; Moult, Alice; Cope, Natalie; McCray, Gareth; Fuller, Richard; McKinley, Robert K

doi:10.1111/medu.14713

Cited by 9 publications

(2 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…All simulation is limited by the parameters of the simulation. In this study, we modelled all known substantial in uences on OSCE scores (candidate, station, examiner, and appropriate random variance terms)(18, 19,34), but omitted in uences shown more recently to be minor such as contrast effects or differential rater function over time (35). Importantly, we can't comment on combinations of parameters which we didn't test (for example 60% examiner participation, 3 linking videos or 12% baseline difference) nor can we infer beyond the range of modelled parameters (i.e.…”

Section: Limitationsmentioning

confidence: 99%

VESCA’s variable precision: Determining the accuracy of adjustment for examiner differences in distributed OSCEs

Yeates

McCray

2023

Preprint

Self Cite

View full text Add to dashboard Cite

Introduction: Ensuring examiner equivalence across assessment locations is a priority within distributed Objective Structured Clinical Exams (OSCEs) but is challenging due to lack of overlap in performances judged by different groups of examiners. Yeates et al have develop a methodology (Video-based Examiner Score Comparison and Adjustment (VESCA)) to compare and (potentially) adjust for the influence of different groups of examiners within OSCEs. Whilst initial research has been promising, the accuracy of the adjusted scores produced by VESCA is unknown. As this is critical to VESCA’s utility, we aimed to investigate the accuracy of adjusted scores produced by VESCA under a range of plausible operational parameters. Methods: using statistical simulation, we investigated how: 1/proportion of participating examiners, 2/ number of linking videos, 3/baseline differences in examiner stringency between schools, 4/number of OSCE stations and 5/different degrees of random error within examiners’ judgements influenced accuracy of adjusted scores. We generated distributions of students’ “true” performances across several stations, added examiner error, and simulated linking through crossed video-scoring, before using Many Facet Rasch Modelling to produce adjusted scores, replicating 1000 times for each permutation, to determine average error reduction and the proportion of students whose scores became more accurate. Results: Under all conditions where no baseline difference existed between groups of examiners (i.e. random rather than systematic variance), score adjustment minimally improved or worsened score accuracy. Conversely, as modelled (systematic) baseline differences between schools increased, adjustment accuracy increased, reducing error by up to 71% and making scores more accurate for up to 93% of students in the 20% baseline-difference condition. Conclusions: score adjustment through VESCA will substantially enhance equivalence for candidates in distributed OSCEs when 10–20% baseline differences exist between examiners in different schools. As such differences are plausible in practice, consideration should be given to use of VESCA in large scale/national exams.

show abstract

Section: Limitationsmentioning

confidence: 99%

VESCA’s variable precision: Determining the accuracy of adjustment for examiner differences in distributed OSCEs

Yeates

McCray

2023

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Their findings suggest that despite following accepted procedures for OSCE conduct, significant differences may persist between groups of examiners which could affect the pass/fail classification of a significant minority of students. Follow-up work has enhanced the technique’s feasibility, 24 and shown that it is adequately robust to several potential confounding influences 25 and variations in implementation. 26 While these findings suggest that examiner-cohort effects are important and support the validity of VESCA for their measurement, VESCA has not yet been used across institutions, so both the likely magnitude of effects which may arise, and the practical implications of applying the method across institutions are unknown.…”

Section: Introductionmentioning

confidence: 99%

Enhancing authenticity, diagnosticity andequivalence (AD-Equiv) in multicentre OSCE exams in health professionals education: protocol for a complex intervention study

Yeates

Maluf²,

Kinston³

et al. 2022

BMJ Open

Self Cite

View full text Add to dashboard Cite

IntroductionObjective structured clinical exams (OSCEs) are a cornerstone of assessing the competence of trainee healthcare professionals, but have been criticised for (1) lacking authenticity, (2) variability in examiners’ judgements which can challenge assessment equivalence and (3) for limited diagnosticity of trainees’ focal strengths and weaknesses. In response, this study aims to investigate whether (1) sharing integrated-task OSCE stations across institutions can increase perceived authenticity, while (2) enhancing assessment equivalence by enabling comparison of the standard of examiners’ judgements between institutions using a novel methodology (video-based score comparison and adjustment (VESCA)) and (3) exploring the potential to develop more diagnostic signals from data on students’ performances.Methods and analysisThe study will use a complex intervention design, developing, implementing and sharing an integrated-task (research) OSCE across four UK medical schools. It will use VESCA to compare examiner scoring differences between groups of examiners and different sites, while studying how, why and for whom the shared OSCE and VESCA operate across participating schools. Quantitative analysis will use Many Facet Rasch Modelling to compare the influence of different examiners groups and sites on students’ scores, while the operation of the two interventions (shared integrated task OSCEs; VESCA) will be studied through the theory-driven method of Realist evaluation. Further exploratory analyses will examine diagnostic performance signals within data.Ethics and disseminationThe study will be extra to usual course requirements and all participation will be voluntary. We will uphold principles of informed consent, the right to withdraw, confidentiality with pseudonymity and strict data security. The study has received ethical approval from Keele University Research Ethics Committee. Findings will be academically published and will contribute to good practice guidance on (1) the use of VESCA and (2) sharing and use of integrated-task OSCE stations.

show abstract

Technology enhanced assessment: Ottawa consensus statement and recommendations

et al. 2022

View full text Add to dashboard Cite

Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams

Cited by 9 publications

References 47 publications

VESCA’s variable precision: Determining the accuracy of adjustment for examiner differences in distributed OSCEs

VESCA’s variable precision: Determining the accuracy of adjustment for examiner differences in distributed OSCEs

Enhancing authenticity, diagnosticity andequivalence (AD-Equiv) in multicentre OSCE exams in health professionals education: protocol for a complex intervention study

Technology enhanced assessment: Ottawa consensus statement and recommendations

Contact Info

Product

Resources

About