Sample size and precision of estimates in studies of depression screening tool accuracy: A meta‐research review of studies published in 2018–2021

Nassar, Elsa-Lynn; Levis, Brooke; Neyer, Marieke A.; Rice, Danielle B; Booij, Linda; Benedetti, Andrea; Thombs, Brett D.

doi:10.1002/mpr.1910

Cited by 4 publications

(5 citation statements)

References 42 publications

(55 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, we found that 72% of included studies did not specify the targeted sample size including how it was calculated. Similar trends were observed in reviews of depression screening tool accuracy studies published between (1) and 36 (34%) provided reasonably accurate confidence intervals (Nassar et al, 2022a). Importantly, accuracy studies with small samples sizes often fail to identify the most accurate cut-off and overstate accuracy estimates for the cut-offs they report (Bhandari et al, 2021).…”

Section: Discussionsupporting

confidence: 66%

“…However, we found that 72% of included studies did not specify the targeted sample size including how it was calculated. Similar trends were observed in reviews of depression screening tool accuracy studies published between (1) 2013 and 2015, where only three of 89 (3%) studies described a viable sample size calculation and 30 studies (34%) provided reasonably accurate confidence intervals around accuracy estimates (Thombs & Rice, 2016 ) and (2) 2018 and 2021, where only 12 of 106 (11%) studies described a viable sample size calculation and 36 (34%) provided reasonably accurate confidence intervals (Nassar et al., 2022a ). Importantly, accuracy studies with small samples sizes often fail to identify the most accurate cut‐off and overstate accuracy estimates for the cut‐offs they report (Bhandari et al., 2021 ).…”

Section: Discussionmentioning

confidence: 99%

“…This was part of a series of three meta‐research reviews that evaluated recently published studies of depression screening tool accuracy. The other two reviews examined the reporting of sample size calculations and precision of accuracy estimates (Nassar et al., 2022a ) and the characteristics of participants included in studies (Nassar et al., 2022b ). Prior to initiating the present study, a study protocol was posted on the Open Science Framework ( https://osf.io/5tvf3/ ).…”

Section: Methodsmentioning

confidence: 99%

See 2 more Smart Citations

Transparency and completeness of reporting of depression screening tool accuracy studies: A meta‐research review of adherence to the Standards for Reporting of Diagnostic Accuracy Studies statement

Nassar

Levis

Neyer

et al. 2022

Int J Methods Psych Res

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, we found that 72% of included studies did not specify the targeted sample size including how it was calculated. Similar trends were observed in reviews of depression screening tool accuracy studies published between (1) and 36 (34%) provided reasonably accurate confidence intervals (Nassar et al, 2022a). Importantly, accuracy studies with small samples sizes often fail to identify the most accurate cut-off and overstate accuracy estimates for the cut-offs they report (Bhandari et al, 2021).…”

Section: Discussionsupporting

confidence: 66%

“…However, we found that 72% of included studies did not specify the targeted sample size including how it was calculated. Similar trends were observed in reviews of depression screening tool accuracy studies published between (1) 2013 and 2015, where only three of 89 (3%) studies described a viable sample size calculation and 30 studies (34%) provided reasonably accurate confidence intervals around accuracy estimates (Thombs & Rice, 2016 ) and (2) 2018 and 2021, where only 12 of 106 (11%) studies described a viable sample size calculation and 36 (34%) provided reasonably accurate confidence intervals (Nassar et al., 2022a ). Importantly, accuracy studies with small samples sizes often fail to identify the most accurate cut‐off and overstate accuracy estimates for the cut‐offs they report (Bhandari et al., 2021 ).…”

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Transparency and completeness of reporting of depression screening tool accuracy studies: A meta‐research review of adherence to the Standards for Reporting of Diagnostic Accuracy Studies statement

Nassar

Levis

Neyer

et al. 2022

Int J Methods Psych Res

Self Cite

View full text Add to dashboard Cite

show abstract

“…Any decision analysis to select cutoffs for use in future trials should rely upon evidence from large, high-quality meta-analyses. (Nassar et al, 2022). Furthermore, two-step methods, in which all participants with positive screens but only a proportion with negative screens are assessed with diagnostic interviews, could be used to obtain satisfactory precision for both sensitivity and specificity (Thombs et al, 2018).…”

Section: Discussionmentioning

confidence: 99%

“…If optimal cutoffs are identified, possible bias from data‐driven methods and imprecision of estimates should be noted, and these factors should be considered before author recommendations are made. As recommended by STARD, a priori sample size calculations should be conducted in depression screening tool accuracy studies to avoid drawing potentially misleading conclusions from overly small samples, but only approximately 10% of such studies report sample size calculations (Nassar et al., 2022). Furthermore, two‐step methods, in which all participants with positive screens but only a proportion with negative screens are assessed with diagnostic interviews, could be used to obtain satisfactory precision for both sensitivity and specificity (Thombs et al., 2018).…”

Section: Discussionmentioning

confidence: 99%

‘Optimal’ cutoff selection in studies of depression screening tool accuracy using the PHQ‐9, EPDS, or HADS‐D: A meta‐research study

Brehaut

Neupane

Levis

et al. 2022

Int J Methods Psych Res

Self Cite

View full text Add to dashboard Cite

Objectives: Optimal cutoff thresholds are selected to separate 'positive' from 'negative' screening results. We evaluated how depression screening tool studies select optimal cutoffs. Methods:We included studies from previously conducted meta-analyses of Patient Health Questionnaire-9, Edinburgh Postnatal Depression Scale, or Hospital Anxiety and Depression Scale-Depression accuracy. Outcomes included whether an optimal cutoff was selected, method used, recommendations made, and reporting guideline and protocol citation. Results:Of 212 included studies, 172 (81%) attempted to identify an optimal cutoff, and 147 of these 172 (85%) reported one or more methods. Methods were heterogeneous with Youden's J (N = 35, 23%) most common. Only 23 of 147 (16%) studies described a rationale for their method. Rationales focused on balancing sensitivity and specificity without describing why desirable. 131 of 172 studiesThis is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

show abstract

Data-Driven Cutoff Selection for the Patient Health Questionnaire-9 Depression Screening Tool

Levis,

Bhandari,

Neupane

et al. 2024

JAMA Netw Open

View full text Add to dashboard Cite

ImportanceTest accuracy studies often use small datasets to simultaneously select an optimal cutoff score that maximizes test accuracy and generate accuracy estimates.ObjectiveTo evaluate the degree to which using data-driven methods to simultaneously select an optimal Patient Health Questionnaire-9 (PHQ-9) cutoff score and estimate accuracy yields (1) optimal cutoff scores that differ from the population-level optimal cutoff score and (2) biased accuracy estimates.Design, Setting, and ParticipantsThis study used cross-sectional data from an existing individual participant data meta-analysis (IPDMA) database on PHQ-9 screening accuracy to represent a hypothetical population. Studies in the IPDMA database compared participant PHQ-9 scores with a major depression classification. From the IPDMA population, 1000 studies of 100, 200, 500, and 1000 participants each were resampled.Main Outcomes and MeasuresFor the full IPDMA population and each simulated study, an optimal cutoff score was selected by maximizing the Youden index. Accuracy estimates for optimal cutoff scores in simulated studies were compared with accuracy in the full population.ResultsThe IPDMA database included 100 primary studies with 44 503 participants (4541 [10%] cases of major depression). The population-level optimal cutoff score was 8 or higher. Optimal cutoff scores in simulated studies ranged from 2 or higher to 21 or higher in samples of 100 participants and 5 or higher to 11 or higher in samples of 1000 participants. The percentage of simulated studies that identified the true optimal cutoff score of 8 or higher was 17% for samples of 100 participants and 33% for samples of 1000 participants. Compared with estimates for a cutoff score of 8 or higher in the population, sensitivity was overestimated by 6.4 (95% CI, 5.7-7.1) percentage points in samples of 100 participants, 4.9 (95% CI, 4.3-5.5) percentage points in samples of 200 participants, 2.2 (95% CI, 1.8-2.6) percentage points in samples of 500 participants, and 1.8 (95% CI, 1.5-2.1) percentage points in samples of 1000 participants. Specificity was within 1 percentage point across sample sizes.Conclusions and RelevanceThis study of cross-sectional data found that optimal cutoff scores and accuracy estimates differed substantially from population values when data-driven methods were used to simultaneously identify an optimal cutoff score and estimate accuracy. Users of diagnostic accuracy evidence should evaluate studies of accuracy with caution and ensure that cutoff score recommendations are based on adequately powered research or well-conducted meta-analyses.

show abstract

Sample size and precision of estimates in studies of depression screening tool accuracy: A meta‐research review of studies published in 2018–2021

Cited by 4 publications

References 42 publications

Transparency and completeness of reporting of depression screening tool accuracy studies: A meta‐research review of adherence to the Standards for Reporting of Diagnostic Accuracy Studies statement

Transparency and completeness of reporting of depression screening tool accuracy studies: A meta‐research review of adherence to the Standards for Reporting of Diagnostic Accuracy Studies statement

‘Optimal’ cutoff selection in studies of depression screening tool accuracy using the PHQ‐9, EPDS, or HADS‐D: A meta‐research study

Data-Driven Cutoff Selection for the Patient Health Questionnaire-9 Depression Screening Tool

Contact Info

Product

Resources

About