2023
DOI: 10.1111/epi.17537
|View full text |Cite
|
Sign up to set email alerts
|

Using Critical Success Index or Gilbert Skill Score as composite measures of positive predictive value and sensitivity in diagnostic accuracy studies: Weather forecasting informing epilepsy research

Abstract: The Critical Success Index (CSI) and Gilbert Skill score (GS) are verification measures that are commonly used to check the accuracy of weather forecasting.In this article, we propose that they can also be used to simplify the joint interpretation of positive predictive value (PPV) and sensitivity estimates across diagnostic accuracy studies of epilepsy data. This is because CSI and GS each provide a single measure that takes the weather forecasting equivalent of PPV and sensitivity into account. We have re-an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 10 publications
0
7
0
Order By: Relevance
“…For example, high EI values could result from very high numbers of TN alone even if numbers of TP were modest as long as numbers of FP and FN were few, a situation which may be encountered for example when handling administrative health datasets [22] and polygenic hazard scores [23]. Addressing the class imbalance problem using methods which oversample the minority class (in the current example TP cases), such as variants of the synthetic majority oversampling technique, SMOTE [21], or which undersample the majority class (in the current example TN) might be applicable.…”
Section: Limitationsmentioning
confidence: 99%
“…For example, high EI values could result from very high numbers of TN alone even if numbers of TP were modest as long as numbers of FP and FN were few, a situation which may be encountered for example when handling administrative health datasets [22] and polygenic hazard scores [23]. Addressing the class imbalance problem using methods which oversample the minority class (in the current example TP cases), such as variants of the synthetic majority oversampling technique, SMOTE [21], or which undersample the majority class (in the current example TN) might be applicable.…”
Section: Limitationsmentioning
confidence: 99%
“…This circumstance makes it difficult to rank the diagnostic accuracy of the corresponding case-ascertainment algorithms based on Spec, NPV, or Acc, as the figures are all similarly high [ 32 ]. In conditions such as dementia [ 33 ], motor neuron disease [ 34 ], and epilepsy [ 35 , 36 ], systematic reviews of the diagnostic accuracy of routine data indicate that the original studies published have largely measured the positive predictive value (PPV) and sensitivity (Sens) without measuring Spec or NPV.…”
Section: Introductionmentioning
confidence: 99%
“…We have demonstrated the advantages of using CSI to complement traditional diagnostic accuracy measures using real-word data in several conditions [ 32 , 40 , 57 ].…”
Section: Introductionmentioning
confidence: 99%
“…In terms of the base data of the 2x2 contingency table: CSI may also be expressed in terms of PPV and Sens: We have demonstrated the advantages of using CSI to complement traditional diagnostic accuracy measures using real-word data in several conditions. 3,9,17 A question often raised about CSI concerns how its values relate to prevalence, P, the probability of a positive diagnosis. It is well-known that values of PPV vary with P, hence are sensitive to class imbalance and may therefore not be generalizable between studies.…”
Section: Introductionmentioning
confidence: 99%
“…This circumstance makes it difficult to rank the diagnostic accuracy of the corresponding case-ascertainment algorithms based on Spec, NPV, or Acc, as the figures are all similarly high. 3 In conditions such as dementia, 4 motor neurone disease, 5 and epilepsy, 2 systematic reviews of the diagnostic accuracy of routine data indicate that the original studies published have largely measured positive predictive value (PPV) and Sens without measuring Spec or NPV. This is because finding true negative cases in the community to verify an absent diagnostic code in a routine dataset is a challenge for researchers, who often only have permission to study populations that have been positively coded with the disease in question.…”
Section: Introductionmentioning
confidence: 99%