The Benefit of a Switch: Answer‐Changing on Multiple‐Choice Exams by First‐Year Dental Students

Pagni, Sarah E.; Bak, Anna G.; Eisen, Steven E.; Murphy, Jennipher L.; Finkelman, Matthew; Kugel, Gérard

doi:10.1002/j.0022-0337.2017.81.1.tb06253.x

Cited by 15 publications

(13 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Much of the research on test takers’ response revision behavior in a linear test has focused on investigating how response revisions influence the test performance. Consistent findings have been reported in these studies showing most test takers change responses; test takers typically change a small portion of responses on a test; most of the changes are from wrong to right responses and are likely to improve test scores (e.g., Al-Hamly & Coombe, 2005; Benjamin Jr. et al, 1984; Kruger et al, 2005; Liu et al, 2015; Pagni et al, 2017; Passos et al, 2007). In general, it is believed that response revisions are able to correct errors made by careless or speeded responding, and many studies support response revision when there is a good reason for doing so (e.g., Liu et al, 2015; Pagni et al, 2017).…”

Section: Introductionsupporting

confidence: 82%

“…Consistent findings have been reported in these studies showing most test takers change responses; test takers typically change a small portion of responses on a test; most of the changes are from wrong to right responses and are likely to improve test scores (e.g., Al-Hamly & Coombe, 2005; Benjamin Jr. et al, 1984; Kruger et al, 2005; Liu et al, 2015; Pagni et al, 2017; Passos et al, 2007). In general, it is believed that response revisions are able to correct errors made by careless or speeded responding, and many studies support response revision when there is a good reason for doing so (e.g., Liu et al, 2015; Pagni et al, 2017). In addition to the findings of the benefit of response revision on improving test performance, many studies across a number of disciplines examined factors that influence the response revision and test performance, such as gender (e.g., Al-Hamly & Coombe, 2005), test taker’s cognitive style (e.g., S.…”

Section: Introductionsupporting

confidence: 82%

See 1 more Smart Citation

Adaptive Weight Estimation of Latent Ability: Application to Computerized Adaptive Testing With Response Revision

Wang¹,

Xiao

Cohen³

2020

Journal of Educational and Behavioral Statistics

View full text Add to dashboard Cite

An adaptive weight estimation approach is proposed to provide robust latent ability estimation in computerized adaptive testing (CAT) with response revision. This approach assigns different weights to each distinct response to the same item when response revision is allowed in CAT. Two types of weight estimation procedures, nonfunctional and functional weight, are proposed to determine the weight adaptively based on the compatibility of each revised response with the assumed statistical model in relation to remaining observations. The application of this estimation approach to a data set collected from a large-scale multistage adaptive testing demonstrates the capability of this method to reveal more information regarding the test taker’s latent ability by using the valid response path compared with only using the very last response. Limited simulation studies were concluded to evaluate the proposed ability estimation method and to compare it with several other estimation procedures in literature. Results indicate that the proposed ability estimation approach is able to provide robust estimation results in two test-taking scenarios.

show abstract

Section: Introductionsupporting

confidence: 82%

Section: Introductionsupporting

confidence: 82%

Adaptive Weight Estimation of Latent Ability: Application to Computerized Adaptive Testing With Response Revision

Wang¹,

Xiao

Cohen³

2020

Journal of Educational and Behavioral Statistics

View full text Add to dashboard Cite

show abstract

“…For example, in a conventional paper-pencil test one can review and revise previous answers at any time during the test. Moreover, various empirical studies indicate that the opportunity to revise not only alleviates test-taking anxiety and is appreciated by test takers (e.g., Han 2005;Kruger et al 2015;Liu et al 2015), but it also allows the correction of accidental errors and as a result can improve the validity of test scores (e.g., Al-Hamly and Coombe 2005;Benjamin et al 1984;Pagni et al 2017).…”

Section: Introductionmentioning

confidence: 99%

Statistical Foundations for Computerized Adaptive Testing with Response Revision

2019

View full text Add to dashboard Cite

The compatibility of computerized adaptive testing (CAT) with response revision has been a topic of debate in psychometrics for many years. The problem is to provide test takers opportunities to change their answers during the test, while discouraging deceptive strategies from their side and preserving the statistical efficiency of the traditional CAT. The estimating approach proposed in Wang et al. (Stat Sin 27(4):1987-2010, based on the nominal response model, allows test takers to provide more than one answer to each item during the test, which they all contribute to the interim and final ability estimation. This approach is here reformulated, extended to incorporate a larger class of polytomous and dichotomous item response theory models, and investigated with simulation studies under different test-taking strategies.

show abstract

“…To our knowledge, the idea of using log data from an EAP to analyse exam items was first introduced by Neel's 1999 work, presented at the Annual Meeting of AERA (cited in Jung Kim, 2001). To date, exam logs have mostly been used for measuring and modelling exam‐takers' accuracy, speed, revisits and effort (Bezirhan et al, 2021; Klein Entink et al, 2008; Sharma et al, 2020; Wise, 2015; Wise & Gao, 2017); analysing answering and revising behaviour during exams (Costagliola et al, 2008; Pagni et al, 2017); examining and enhancing metacognitive regulation of strategy use and cognitive processing (Dodonova & Dodonov, 2012; Goldhammer et al, 2014; Papamitsiou & Economides, 2015; Thillmann et al, 2013); classifying exam‐takers towards testing services personalisation (Papamitsiou & Economides, 2017); validating the interpretations of test score (Engelhardt & Goldhammer, 2019; Kane & Mislevy, 2017; Kong et al, 2007; Padilla & Benítez, 2014; Toton & Maynes, 2019; van der Linden & Guo, 2008); understanding exam‐takers' performance (Greiff et al, 2016; Kupiainen et al, 2014; Papamitsiou et al, 2014, 2018; Papamitsiou & Economides, 2013, 2014); enhancing item selection in adaptive testing environment (van der Linden, 2008); analysing exam items (Costagliola et al, 2008; Jung Kim, 2001); detecting cheating (Cleophas et al, 2021; Costagliola et al, 2008); and identifying test‐taking strategies (Costagliola et al, 2008). Nonetheless, most of the previous work focused on time‐based behaviours and the interpretation of exam‐taker results; few of them examined the potential of using exam‐taker behaviours to validate or enrich the interpretation of the quality of exam items.…”

Section: Background and Related Workmentioning

confidence: 99%

Beyond item analysis: Connecting student behaviour and performance using e‐assessment logs

Lahza

Smith

Khosravi

2022

Brit J Educational Tech

View full text Add to dashboard Cite

Traditional item analyses such as classical test theory (CTT) use exam‐taker responses to assessment items to approximate their difficulty and discrimination. The increased adoption by educational institutions of electronic assessment platforms (EAPs) provides new avenues for assessment analytics by capturing detailed logs of an exam‐taker's journey through their exam. This paper explores how logs created by EAPs can be employed alongside exam‐taker responses and CTT to gain deeper insights into exam items. In particular, we propose an approach for deriving features from exam logs for approximating item difficulty and discrimination based on exam‐taker behaviour during an exam. Items for which difficulty and discrimination differ significantly between CTT analysis and our approach are flagged through outlier detection for independent academic review. We demonstrate our approach by analysing de‐identified exam logs and responses to assessment items of 463 medical students enrolled in a first‐year biomedical sciences course. The analysis shows that the number of times an exam‐taker visits an item before selecting a final response is a strong indicator of an item's difficulty and discrimination. Scrutiny by the course instructor of the seven items identified as outliers suggests our log‐based analysis can provide insights beyond what is captured by traditional item analyses. What is already known about this topic Traditional item analysis is based on exam‐taker responses to the items using mathematical and statistical models from classical test theory (CTT). The difficulty and discrimination indices thus calculated can be used to determine the effectiveness of each item and consequently the reliability of the entire exam. What this paper adds Data extracted from exam logs can be used to identify exam‐taker behaviours which complement classical test theory in approximating the difficulty and discrimination of an item and identifying items that may require instructor review. Implications for practice and/or policy Identifying the behaviours of successful exam‐takers may allow us to develop effective exam‐taking strategies and personal recommendations for students. Analysing exam logs may also provide an additional tool for identifying struggling students and items in need of revision.

show abstract

The Benefit of a Switch: Answer‐Changing on Multiple‐Choice Exams by First‐Year Dental Students

Cited by 15 publications

References 32 publications

Adaptive Weight Estimation of Latent Ability: Application to Computerized Adaptive Testing With Response Revision

Adaptive Weight Estimation of Latent Ability: Application to Computerized Adaptive Testing With Response Revision

Statistical Foundations for Computerized Adaptive Testing with Response Revision

Beyond item analysis: Connecting student behaviour and performance using e‐assessment logs

Contact Info

Product

Resources

About