Subject-Area Knowledge Measured by Scores on the National Association of State Boards of Geology (ASBOG) Fundamentals Examination and the Implications for Academic Preparation

Williams, John W.; Warner, Jack L.; Warner, Steven P.

doi:10.5408/1089-9995-52.4.374

Cited by 5 publications

(1 citation statement)

References 1 publication

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One should set a standard for reliability (Fan & Thompson, 2001). We set a minimum target in our study at .70 given practitioners for high stakes MC professional tests use this criterion (Williams et al, 2004) and classroom exams can be for high stakes (from a student perspective). What research has been conducted on reliability of MC and PS exams?…”

Section: Literature Reviewmentioning

confidence: 99%

Reliability of multiple-choice versus problem-solving student exam scores in higher education: Empirical tests

Lee

Garg²

2020

6th International Conference on Higher Education Advances (HEAd'20)

View full text Add to dashboard Cite

Instructors in higher education frequently employ examinations composed of problem-solving questions to assess student knowledge and learning. But are student scores on these tests reliable? Surprisingly few have researched this question empirically, arguably because of perceived limitations in traditional research methods. Furthermore, many believe multiple choice exams to be a more objective, reliable form of testing students than any other type. We question this wide-spread belief. In a series of empirical studies in 8 classes (401 students) in a finance course, we used a methodology based on three key elements to examine these questions: A true experimental design, more appropriate estimation of exam score reliability, and reliability confidence intervals. Internal consistency reliabilities of problem-solving test scores were consistently high (all > .87, median = .90) across different classes, students, examiners, and exams. In contrast, multiple-choice test scores were less reliable (all < .69). Recommendations are presented for improving the construction of exams in higher education.

show abstract

Section: Literature Reviewmentioning

confidence: 99%