“…When relevant, we computed the base rate of failure (BR Fail ; i.e., the percent of the sample that failed a given cutoff). The prevalence of the condition of interest (in this context, BR Fail ) is a descriptive statistic that is important for understanding classification accuracy in general ( Wald & Bestwick, 2014 ) and in the context of performance validity assessment specifically ( Abeare, Messa, et al., 2019 ). Although area under the curve (AUC) is useful for comparing overall classification accuracy across models ( Altman & Bland, 1994 ; Fawcett, 2006 ; Marzban, 2004 ), its clinical relevance has been called into question ( Hand, 2009 ; Lobo et al., 2008 ; Wald & Bestwick, 2014 ).…”