The Cost of Untracked Diversity in Brain-Imaging Prediction

Benkarim, Oualid; Paquola, Casey; Park, B; Kebets,; Hong, Seok‐Jun; R, Vos de Wael; Zhang, Shaoshi; Yeo, B.T. Thomas; Eickenberg, Michael; Ge, Tida; Jb, Poline; Bernhardt, Boris C.; Bzdok, Danilo

doi:10.1101/2021.06.16.448764

Cited by 11 publications

(16 citation statements)

References 146 publications

(170 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The neuroimaging community is beginning to recognize and explore the impacts of ethics in machine learning models, with a particular focus on bias in datasets and models (Benkarim et al 2021). Trust is distinct from bias, and it is an equally important yet widely overlooked facet of ethics in neuroimaging models.…”

Section: Ethics In Neuroimaging: the Role Of Bias And Trustmentioning

confidence: 99%

“…Connectome-based predictive models are at the forefront of this trend (Finn and Rosenberg 2021;Shen et al 2017), showing promising results in understanding general cognition (Beaty et al 2018;Song et al 2021;Dubois et al 2018;Rosenberg et al 2018) and mental health (Du et al 2018;Lynall et al 2010;Nielsen et al 2020). Improvements in accuracy (Cui and Gong 2018;Gan et al 2021;Li et al 2021;Kohoutová et al 2020) and fairness (i.e., lack of bias (Benkarim et al 2021)) of connectome-based models represent an important step in preparing these models for real-world applications. But, accurate and bias-free models are not enough.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Connectome-based machine learning models are vulnerable to subtle data manipulations

Rosenblatt¹,

Rodriguez²,

Westwater³

et al. 2021

Preprint

View full text Add to dashboard Cite

Functional connectome-based predictive models continue to grow in popularity and predictive performance. As these models become more widely used, researchers have begun to question the idea of bias in the models, which is a crucial component of ethics in artificial intelligence. However, we show that model trustworthiness is a more important but vastly overlooked component of the ethics of functional connectome-based predictive models. In this work, we define “trust” as robustness to adversarial attacks, or data alterations designed to trick a model. We show that typical implementations of connectome-based models are untrustworthy and can easily be manipulated through adversarial attacks. We use classification of self-reported biological sex in three datasets (Adolescent Brain Cognitive Development Study, Human Connectome Project, and Philadelphia Neurodevelopmental Cohort) and for three types of predictive models (support vector machine (SVM), logistic regression, kernel SVM) as a benchmark to show that many forms of adversarial attacks are effective against connectome-based models. The attacks include changing the prediction by altering the data at test time, real-world changes at the time of scanning, and improving performance by injecting a pattern into the data. Despite drastic changes in prediction performance after adversarial attacks, the corrupted connectomes appear nearly identical to the original ones and perform similarly in downstream analyses. These findings demonstrate a need to evaluate the trustworthiness and ethics of connectome-based models before we can apply them broadly, as well as a need to develop methods that are robust to a wide range of adversarial attacks.

show abstract

Section: Ethics In Neuroimaging: the Role Of Bias And Trustmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Connectome-based machine learning models are vulnerable to subtle data manipulations

Rosenblatt¹,

Rodriguez²,

Westwater³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Together, this reflects a paradigm shift in human neuroscience research from a focus on the group to a focus on the individual, with important potential applications to clinical practice [21][22][23] .To deliver on this promise, however, these approaches must identify patterns of brain activity that are relevant to the phenotype of interest in a given individual-the patient sitting before their clinician, for example. Previous linear modelling work has relied on the assumptions that (1) a single brain network is associated with a given phenotype, with patterns of activity within that network varying across individuals 10,24,25 ; and (2) larger, more heterogeneous samples will more accurately and reliably capture this single model 26,27 . But although many published models have demonstrated impressive generalizability 6,9,10 , they do not account for brain-phenotype relationships in all individuals 13,14 .…”

mentioning

confidence: 99%

“…But although many published models have demonstrated impressive generalizability 6,9,10 , they do not account for brain-phenotype relationships in all individuals 13,14 . This raises the crucial question of in whom models fail, and why.The existence of structured model failure-some individuals who are better fit by a model than others 14,24,26 -would suggest that one brain-phenotype relationship does not fit all, and that systematic bias may determine who is fit and who is not. This, in turn, may engender imprecise, misleading and in some cases harmful model interpretations.…”

mentioning

confidence: 99%

“…This, in turn, may engender imprecise, misleading and in some cases harmful model interpretations. That is, a brain network that is found to be associated with a given phenotype may only apply to a specific subset of the population at large, limiting its practical utility 14,26,28 , or may not represent the phenotype of interest. Indeed, factors that interfere with adequate phenotypic characterization have been documented for many widely used neurocognitive tests 18,29 , and may include the fallacy of universalism (construct bias), the application of inappropriate norms, discordance between primary and assessment language and the presence of…”

mentioning

confidence: 99%

See 1 more Smart Citation

Brain–phenotype models fail for individuals who defy sample stereotypes

Greene

Shen

Noble

et al. 2022

Nature

109

View full text Add to dashboard Cite

Individual differences in brain functional organization track a range of traits, symptoms and behaviours1–12. So far, work modelling linear brain–phenotype relationships has assumed that a single such relationship generalizes across all individuals, but models do not work equally well in all participants13,14. A better understanding of in whom models fail and why is crucial to revealing robust, useful and unbiased brain–phenotype relationships. To this end, here we related brain activity to phenotype using predictive models—trained and tested on independent data to ensure generalizability15—and examined model failure. We applied this data-driven approach to a range of neurocognitive measures in a new, clinically and demographically heterogeneous dataset, with the results replicated in two independent, publicly available datasets16,17. Across all three datasets, we find that models reflect not unitary cognitive constructs, but rather neurocognitive scores intertwined with sociodemographic and clinical covariates; that is, models reflect stereotypical profiles, and fail when applied to individuals who defy them. Model failure is reliable, phenotype specific and generalizable across datasets. Together, these results highlight the pitfalls of a one-size-fits-all modelling approach and the effect of biased phenotypic measures18–20 on the interpretation and utility of resulting brain–phenotype models. We present a framework to address these issues so that such models may reveal the neural circuits that underlie specific phenotypes and ultimately identify individualized neural targets for clinical intervention.

show abstract