Rubric Development And Inter Rater Reliability Issues In Assessing Learning Outcomes

Newell, James A.; Dahm, Kevin; Newell, Heidi L.

doi:10.18260/1-2--10943

Cited by 24 publications

(18 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Chance agreement given four scoring levels and three graders would be .34 and .06 by the liberal and conservative definitions, respectively. Newell et al (2002) found comparable levels of agreement for three graders using a rubric for grading students' solutions of chemical engineering problems, a task that was not writing based. The rubric developed by Newell et al also had four scoring levels within each dimension.…”

Section: Resultsmentioning

confidence: 90%

“…Agreement was defined liberally as all scores assigned for a dimension by the three graders being within 1 point of one another. These criteria are accepted in the measurement literature (Tinsley & Weiss, 2000) and have been applied in past studies of interrater agreement for grading rubrics (Newell et al, 2002). Agreement on total overall score out of 24 possible points (8 dimensions × 3 points maximum for each) for the 40 papers was also calculated and is described in the results.…”

Section: Evaluating Interrater Agreementmentioning

confidence: 99%

“…Without specific grading instructions, one grader might emphasize grammar, for example, whereas another might emphasize content. A well-developed rubric guides graders in placing the desired emphasis on specific, uniform criteria, so that the role of subjective opinions is minimized (Newell, Dahm, & Newell, 2002). Training of graders usually is necessary to maximize interrater agreement (Zimmaro, 2004).…”

mentioning

confidence: 99%

See 2 more Smart Citations

An Assessment of Reliability and Validity of a Rubric for Grading APA-Style Introductions

Stellmack

Konheim-Kalkstein

Manor

et al. 2009

Teaching of Psychology

View full text Add to dashboard Cite

This article describes the empirical evaluation of the reliability and validity of a grading rubric for grading APAstyle introductions of undergraduate students. Levels of interrater agreement and intrarater agreement were not extremely high but were similar to values reported in the literature for comparably structured rubrics. Rank-order correlations between graders who used the rubric and an experienced instructor who ranked the papers separately and holistically provided evidence for the rubric's validity. Although this rubric has utility as an instructional tool, the data underscore the seemingly unavoidable subjectivity inherent in grading student writing. Instructors are cautioned that merely using an explicit, carefully developed rubric does not guarantee high reliability.

show abstract

Section: Resultsmentioning

confidence: 90%

Section: Evaluating Interrater Agreementmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

An Assessment of Reliability and Validity of a Rubric for Grading APA-Style Introductions

Stellmack

Konheim-Kalkstein

Manor

et al. 2009

Teaching of Psychology

View full text Add to dashboard Cite

show abstract

“…For example, Stellmack, Konheim-Kalkstein, Manor, Massey, and Schmitz (2009) found low interrater agreement (agreement between reviewers in .37 of the scores they assigned) for graders who developed and refined a grading rubric over several months. Newell, Dahm, and Newell (2002) also reported comparably low interrater agreement (.47 proportion of agreement measured in the same way as Stellmack, Konheim-Kalkstein, Manor, Massey, & Schmitz, 2009) in the grading of student writing with a rubric. 1 Indeed, subjectivity and low interrater agreement in evaluating scientific writing are implicitly acknowledged in the peer-review process when an editor seeks reviews from multiple reviewers.…”

mentioning

confidence: 93%

Incentivizing Multiple Revisions Improves Student Writing Without Increasing Instructor Workload

Stellmack

Sandidge

Sippl

et al. 2015

Teaching of Psychology

View full text Add to dashboard Cite

Previous research has shown that when students are required to submit a draft and a revision of their writing, large proportions of students do not improve across drafts. We implemented a writing assignment in which students were permitted to submit up to four optional drafts. To encourage substantive revisions, students were awarded additional points if they received all points on the grading rubric. Based on the grades of the instructors, 31% of students eventually earned perfect scores in this assignment, compared to 13% in a typical single revision assignment. Permitting students to submit up to four optional drafts resulted in nearly the same amount of grading for the instructor as requiring students to submit two drafts.

show abstract

“…A decision was made to have a four-level scale for the rubric, which is consistent with other university-wide holistic rubrics & minimizes the tendency to rate in the middle of odd number level scales. 33,38 Listing includes a description of the various performance levels that are used to write the dimension descriptions. 31 Engineering faculty reviewed the Paul-Elder critical thinking framework to identify key Elements of Thought and Universal Intellectual Standards that would be applicable across engineering courses.…”

Section: Development and Initial Validation Of A Holistic Engineering Critical Thinking Rubricmentioning

confidence: 99%

Refining A Critical Thinking Rubric For Engineering

Ralston

Bays

2010 Annual Conference &Amp; Exposition Proceedings

View full text Add to dashboard Cite

She holds a joint appointment in Engineering Fundamentals and in Chemical Engineering. Dr. Ralston teaches undergraduate engineering mathematics and is currently involved, with other Speed faculty, in educational research on effective use of Tablet PCs in engineering education and the incorporation of critical thinking in engineering education. Her fields of expertise include process modeling, simulation, and process control. Cathy Bays, University of LouisvilleDr. Cathy L. Bays is the assessment specialist for the university's regional reaccreditation Quality Enhancement Plan. In this role she provides leadership across the 8 undergraduate units by demonstrating a broad knowledge of assessment, facilitating unit-specific assessment projects and outcomes, providing faculty development on assessment topics, and supporting the scholarship of assessment. For 15 years she was a faculty member in the School of Nursing at the University of Louisville, serving as Director of the Undergraduate Nursing Program for 5 of those years.

show abstract

Rubric Development And Inter Rater Reliability Issues In Assessing Learning Outcomes

Cited by 24 publications

References 6 publications

An Assessment of Reliability and Validity of a Rubric for Grading APA-Style Introductions

An Assessment of Reliability and Validity of a Rubric for Grading APA-Style Introductions

Incentivizing Multiple Revisions Improves Student Writing Without Increasing Instructor Workload

Refining A Critical Thinking Rubric For Engineering

Contact Info

Product

Resources

About