Rater Source Effects Are Alive and Well After All

Hoffman, Brian J.; Lance, Charles E.; Bynum, Bethany H.; Gentry, William A.

doi:10.1111/j.1744-6570.2009.01164.x

Cited by 124 publications

(177 citation statements)

References 71 publications

Supporting

Mentioning

156

Contrasting

Unclassified

Order By: Relevance

“…Evidence supports that raters from different levels provide unique performance information (Hoffman et al, 2010;Hoffman & Woehr, 2009), implicating overlapping but distinct nomological networks for different sources' ratings. Yet the preponderance of past likingperformance rating research has used a single source of ratings, prohibiting a direct comparison across sources.…”

Section: Performance Appraisal Characteristicsmentioning

confidence: 93%

“…Multiple sources of systematic variance characterize performance ratings (Murphy & DeShon, 2000). Variance related to actual ratee performance is considered construct valid, or true score variance; nonsystematic variance is conceptualized as error; and systematic variance that is unrelated to true score is conceptualized as bias (Hoffman, Lance, Bynum, & Gentry, 2010;Lance, Hoffman, Gentry, & Baranik, 2008). Although the preponderance of research has conceptualized the overlap between rater liking and performance ratings as indicative of rater bias, others have suggested that the relationship between liking and performance ratings reflects "true" differences in ratee performance (Allen & Rush, 1998;Lefkowitz, 2000;Varma et al, 1996).…”

Section: True Performance Interpretationmentioning

confidence: 99%

See 1 more Smart Citation

A Meta-Analysis of the Relationship Between Rater Liking and Performance Ratings

et al. 2013

View full text Add to dashboard Cite

This meta-analysis reviewed the magnitude and moderators of the relationship between rater liking and performance ratings. The results revealed substantial overlap between rater liking and performance ratings (ρ = .77). Although this relationship is often interpreted as indicative of bias, we review studies that indicate that to some extent the relationship between liking and performance ratings potentially reflects "true" differences in ratee performance. Moderator analyses indicated that the relationship between liking and performance ratings was weaker for ratings of organizational citizenship behaviors, ratings made by peer raters, ratings in nonsales jobs, and ratings made for development; however, the relationship was strong across moderator levels, underscoring the robustness of this relationship. Implications for the interpretation of performance ratings are discussed.Performance evaluation systems are central to a cross-section of talent management functions, such as determining employee compensation and rewards, providing developmental feedback, documenting administrative decisions, succession planning, and reinforcing organizational norms (Cascio & Aguinis, 2005). In fact, Ghorpade and Chen (1995) suggested that performance ratings are "inevitable in all organizations-large and small, public and private, local and multinational" (p. 32). Yet performance appraisals have been the subject of substantial criticism over the years. Indeed, skepticism as to the quality of the information obtained from human evaluations has persisted for nearly as long as the field of psychological measurement (Thorndike, 1925;Wells, 1907). Murphy (2008) succinctly summed up the state of affairs, noting "performance ratings are widely viewed as poor measures of job performance" (p. 148).Over the years, a litany of factors has been proposed to hinder the quality of performance ratings. The overarching theme of this school of thought is that raters introduce performance irrelevant variance into performance ratings because they are either unable or unwilling to provide accurate ratings. Early research attributed low-quality ratings to rater ability (or presumably, lack thereof) and sought to design better scales

show abstract

Section: Performance Appraisal Characteristicsmentioning

confidence: 93%

Section: True Performance Interpretationmentioning

confidence: 99%

A Meta-Analysis of the Relationship Between Rater Liking and Performance Ratings

et al. 2013

View full text Add to dashboard Cite

show abstract

“…Leaders must satisfy multiple stakeholders (Tsui & Ashford, 1994;Tsui, Ashford, St. Clair, & Xin, 1995; see also the ecological perspective of multisource ratings : Hoffman, Lance, Bynum, & Gentry, 2010;Lance, Baxter, & Mahan, 2006;Lance, Hoffman, Gentry, & Baranik, 2008), and they must understand what signals their boss and their peers 3 in particular attend to when evaluating them. Relying solely on top-down (i.e., boss) ratings of career derailment potential is somewhat inconsistent with the practical realities of today's socially complex workplace (Gentry & Sosik, 2010) with different stakeholders evaluating leaders based on their own specific expectations of what leaders should do and how they should behave (Tsui & Ashford, 1994;Tsui et al, 1995).…”

Section: Tablementioning

confidence: 99%

How displaying empathic concern may differentially predict career derailment potential for women and men leaders in Australia

Gentry

Clark

Young

et al. 2015

The Leadership Quarterly

Self Cite

View full text Add to dashboard Cite

“…Although much debate has revolved around the meaning in rating distinctions across the various rater categories (e.g., Hoffman, Lance, Bynum, & Gentry, 2010), the inherent value of 360°feedback is that it provides the focal participant with a behaviorally based assessment of his/her performance that is less likely to be criterion deficient (compared with a single-source method). In researching all of the early works purporting the use of 360°feedback, all researchers suggested that utilizing a single-source methodology (i.e., manager ratings) for understanding performance was likely to be criterion deficient (e.g., Edwards & Ewen, 1996;Murphy & Cleveland, 1995).…”

Section: Minimizing Criterion Deficiency With Qualitative Methodologiesmentioning

confidence: 99%

Why the Qualms With Qualitative? Utilizing Qualitative Methods in 360° Feedback

Kabins¹

2016

Ind. Organ. Psychol.

View full text Add to dashboard Cite

Although the authors of the focal article provide a comprehensive definition of 360° feedback, one exclusionary criterion results in an overly narrow definition of 360° feedback. Specifically, Point 3 in their definition described the criticality of strictly using quantitative methods in collecting 360° feedback. The authors provided a brief rationale by stating, “Data generated from truly qualitative interviews would not allow comparisons between rater groups on the same set of behaviors” (Bracken, Rose, & Church, 2016, p. 765). Although there is little doubt about the value in taking a quantitative approach for gathering 360° feedback, it is not clear why this has to be the sole approach. Below, I outline three issues with taking this constricted methodology. That is, first, excluding qualitative methods is not in line with the purpose of 360° feedback, which is directed at minimizing criterion deficiency. Second, qualitative methodologies (in conjunction with quantitative methodologies) are more equipped to provide and inspire a call to action (supporting the change component addressed by the authors). Finally, there are qualitative methods that allow for rigorous quantitative analysis and can provide an additional source of macro organizational-level data.

show abstract

Rater Source Effects Are Alive and Well After All

Cited by 124 publications

References 71 publications

A Meta-Analysis of the Relationship Between Rater Liking and Performance Ratings

A Meta-Analysis of the Relationship Between Rater Liking and Performance Ratings

How displaying empathic concern may differentially predict career derailment potential for women and men leaders in Australia

Why the Qualms With Qualitative? Utilizing Qualitative Methods in 360° Feedback

Contact Info

Product

Resources

About