2022
DOI: 10.1007/s10618-022-00861-0
|View full text |Cite
|
Sign up to set email alerts
|

An external stability audit framework to test the validity of personality prediction in AI hiring

Abstract: Automated hiring systems are among the fastest-developing of all high-stakes AI systems. Among these are algorithmic personality tests that use insights from psychometric testing, and promise to surface personality traits indicative of future success based on job seekers’ resumes or social media profiles. We interrogate the validity of such systems using stability of the outputs they produce, noting that reliability is a necessary, but not a sufficient, condition for validity. Crucially, rather than challengin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 52 publications
0
2
0
Order By: Relevance
“…Others have developed auditing procedures for more targeted purposes, e.g., to test the accuracy of personality prediction in AI systems used for recruitment (Rhea et al, 2022), evaluating the capabilities of language models (Goel et al, 2021;Mökander et al, 2023), providing explanations for black-box AI systems (Pedreschi et al, 2018), and conducting audits of clinical decision support systems (Panigutti et al, 2021). Again, what links all these procedures is that they audit AI systems against predefined technical, functionality and reliability standards.…”
Section: Technical Legal and Ethics-based Approachesmentioning
confidence: 99%
“…Others have developed auditing procedures for more targeted purposes, e.g., to test the accuracy of personality prediction in AI systems used for recruitment (Rhea et al, 2022), evaluating the capabilities of language models (Goel et al, 2021;Mökander et al, 2023), providing explanations for black-box AI systems (Pedreschi et al, 2018), and conducting audits of clinical decision support systems (Panigutti et al, 2021). Again, what links all these procedures is that they audit AI systems against predefined technical, functionality and reliability standards.…”
Section: Technical Legal and Ethics-based Approachesmentioning
confidence: 99%
“…Recent research has examined the accuracy of personality prediction in AI-based hiring systems and found that certain tools demonstrate significant instability in measuring key facets. Consequently, these tools cannot be considered valid assessment instruments ( Rhea et al, 2022 ). However, it is still uncertain whether alternative personality-based hiring algorithms, designed to predict job performance based on personality facets, could potentially result in adverse impacts or biases.…”
Section: Algorithms As a Solution Against Biasmentioning
confidence: 99%
“…It is certainly true that the types of stimuli, measurements, and the technical methods of mapping those measurements to knowledge in the form of assessment scores can be very different when comparing traditional assessments and AI-based assessments (Liem et al 2018). To be clear, here we want to focus on the underlying epistemological and ontological assumptions of the psychometric paradigm, which are shared by both traditional and algorithmic hiring assessments (Rhea et al 2022). This shared set of underlying assumptions implies that their merits and limitations-which have been explored in academic literature over multiple decades-also apply to algorithmic hiring assessments.…”
Section: What Are the Assumptions?mentioning
confidence: 99%
“…Some industrial and organizational (I-O) psychologists have expressed concerns about the scarcity of available evidence supporting the validity, reliability, and fairness of these tools (Gonzalez et al 2019;Tippins et al 2021). In fact, a recent study auditing two personalityassessing algorithms used in hiring concluded that both tools failed to exhibit sufficient reliability and therefore cannot be considered as valid assessments (Rhea et al 2022). Furthermore, some algorithmic assessments measure features for which there is no established and scrutinized theory relating them to job seeker attributes or job performance (e.g., features like tone of voice and facial expressions) (Ajunwa 2021;Hinkle 2021;Stark and Hutson 2021;Tippins et al 2021).…”
Section: Introductionmentioning
confidence: 99%