2021
DOI: 10.31235/osf.io/gu3ap
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The InterModel Vigorish (IMV) as a flexible and portable approach for quantifying predictive accuracy with binary outcomes

Abstract: Understanding the ``fit'' of models meant to predict binary outcomes has been a long-standing problem. We propose a novel metric---the InterModel Vigorish (IMV)---for quantifying the value of change in predictive accuracy between two systems in the case of a binary outcome. The metric is based on an analogy to well-characterized physical systems with tractable probabilities. We first translate a baseline prediction of some binary outcomes into a statement about a canonical system---weighted coins---by equating… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3

Relationship

3
3

Authors

Journals

citations
Cited by 8 publications
(12 citation statements)
references
References 0 publications
2
10
0
Order By: Relevance
“…This is consistent with the notion that individuals of a certain type are more likely to get sampled in the UKB. This pattern of smaller standard deviations is also in line with our simulations as can be visually seen in Our LASSO probit model adequately discriminates between UKB and UK Census observations, with an area under the curve (AUC) of 0.767 [16] (IM V = 0.006 [17]) in the independent holdout sample. For comparison, our AUC is similar to the AUC achieved when predicting mortality in the Health and Retirement Study to correct for mortality selection bias [18].…”
Section: Evidence For Volunteer Bias In Ukbsupporting
confidence: 88%
See 1 more Smart Citation
“…This is consistent with the notion that individuals of a certain type are more likely to get sampled in the UKB. This pattern of smaller standard deviations is also in line with our simulations as can be visually seen in Our LASSO probit model adequately discriminates between UKB and UK Census observations, with an area under the curve (AUC) of 0.767 [16] (IM V = 0.006 [17]) in the independent holdout sample. For comparison, our AUC is similar to the AUC achieved when predicting mortality in the Health and Retirement Study to correct for mortality selection bias [18].…”
Section: Evidence For Volunteer Bias In Ukbsupporting
confidence: 88%
“…Our LASSO probit model adequately discriminates between UKB and UK Census observations, with an area under the curve (AUC) of 0.767 [16] ( IMV = 0.006 [17]) in the independent holdout sample. For comparison, our AUC is similar to the AUC achieved when predicting mortality in the Health and Retirement Study to correct for mortality selection bias [18].…”
Section: Resultsmentioning
confidence: 99%
“…To account for the data imbalances, we use Welch’s t test and two different but complementary methods within the supervised learning framework. The first is the InterModel Vigorish (IMV), a new technique to analyze the outcomes of predictive models (specifically logistic regression) that works by comparing the predictive accuracy of a given model with another model that only predicts the mode outcome [ 50 ]. The results from this method could be compared to the common vigorish (percentage of winnings deducted from players) of 1% in blackjack which, on average, becomes 0.0099 cents per dollar for the casino.…”
Section: Methodsmentioning
confidence: 99%
“…For the 2015-2016, the IMV between a baseline prevalence model and one that uses essays was approximately 0.03. This is comparable to the IMV for predicting death within the next two years for those approximately 90 years old given additional information about their age, sex, and education level [44]. In the models for the 2016-2017 essays and entire corpus, the IMV drops to approximately one third of the IMV score for the 2015-2016 essays.…”
Section: Study One: Variation In Essay Content and Stylementioning
confidence: 63%
“…To account for the data imbalances, we use Welch's t-test and two different but complementary methods within the supervised learning framework. The first is the InterModel Vigorish (IMV), a new technique to analyze the outcomes of predictive models (specifically logistic regression) that works by comparing the predictive accuracy of a given model with another model that only predicts the mode outcome [44]. The results from this method could be compared to the common vigorish (percentage of winnings deducted from players) of 1% in blackjack which, on average, becomes 0.0099 cents per dollar for the casino.…”
Section: Study One: Essay Content and Stylementioning
confidence: 99%