Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics

Cwiek, Andrew; Rajtmajer, Sarah; Wyble, Brad; Honavar, Vasant; Grossner, Emily C.; Hillary, Frank G.

doi:10.1162/netn_a_00212

Cited by 17 publications

(19 citation statements)

References 81 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Importantly, according to the criteria of Cohen (1988) all observed effect sizes ( r ~ .2) can be considered as small and only few out of all calculated measures reached statistical significance, when correcting for multiple comparisons. These results contribute to the current debate about the effect size to be expected in investigations on brain-behavior relations (Marek et al, 2022; Rosenberg and Finn, 2022) in demonstrating that the combination of cross-validation (Sui et al, 2020; Cwiek et al, 2022) and multimodal analyses approaches can identify robust brain-behavior relations despite sample sizes that lie clearly below thousand. We compared three forms of analyses (explanation of intelligence scores, internal cross-validation, and out-of-sample prediction: prediction of intelligence scores in a replication sample with the model constructed in the main sample) to show, in line with Cwiek et al, (2022), that cross-validation reduces the overall effect size markedly as compared to explanation ( r = .31 to r = .22 and r = .23).…”

Section: Discussionmentioning

confidence: 61%

“…These results contribute to the current debate about the effect size to be expected in investigations on brain-behavior relations (Marek et al, 2022; Rosenberg and Finn, 2022) in demonstrating that the combination of cross-validation (Sui et al, 2020; Cwiek et al, 2022) and multimodal analyses approaches can identify robust brain-behavior relations despite sample sizes that lie clearly below thousand. We compared three forms of analyses (explanation of intelligence scores, internal cross-validation, and out-of-sample prediction: prediction of intelligence scores in a replication sample with the model constructed in the main sample) to show, in line with Cwiek et al, (2022), that cross-validation reduces the overall effect size markedly as compared to explanation ( r = .31 to r = .22 and r = .23). That internally cross-validated effect size reflects a more realistic estimate of the ‘true’ effect size (Yarkoni and Westfall, 2017) is supported by our out-of-sample-prediction in the independent sample ( r = .23).…”

Section: Discussionmentioning

confidence: 61%

See 1 more Smart Citation

Multimodal Brain Signal Complexity Predicts Human Intelligence

Thiele

Richter

Hilger

2022

Preprint

View full text Add to dashboard Cite

Spontaneous brain activity builds the foundation for human cognitive processing during external demands. A huge number of neuroimaging studies identified specific characteristics of spontaneous (intrinsic) brain dynamics to be associated with individual differences in general cognitive ability, i.e., intelligence. However, respective research is inherently limited by low temporal resolution, thus, preventing conclusions about neural fluctuations within the range of milliseconds. Here, we used resting-state electroencephalographical (EEG) recordings from 144 healthy adults to test whether individual differences in intelligence (Raven’s Advanced Progressive Matrices scores) can be predicted from the complexity of temporally highly resolved intrinsic brain signals. We compared different operationalizations of brain signal complexity (multiscale entropy, Shannon entropy, Fuzzy entropy, and specific characteristics of microstates) in regard to their relation to intelligence. The results indicate that associations between brain signal complexity measures and intelligence are of small effect sizes (r ~ .20) and vary across different spatial and temporal scales. Specifically, higher intelligence scores were associated with lower complexity in local aspects of neural processing, and less activity in task-negative brain regions belonging to the default-mode network. Finally, we combined multiple measures of brain signal complexity to show that individual intelligence scores can be significantly predicted with a multimodal model within the sample (10-fold cross-validation) as well as in an independent sample (external replication, N = 57). In sum, our results highlight the temporal and spatial dependency of associations between intelligence and intrinsic brain dynamics, proposing multimodal approaches as promising means for future neuroscientific research on complex human traits.Significance StatementSpontaneous brain activity builds the foundation for intelligent processing - the ability of humans to adapt to various cognitive demands. Using resting-state EEG, we extracted multiple aspects of temporally highly resolved intrinsic brain dynamics to investigate their relationship with individual differences in intelligence. Single associations were of small effect sizes and varied critically across spatial and temporal scales. However, combining multiple measures in a multimodal cross-validated prediction model, allows to significantly predict individual intelligence scores in unseen participants. Our study adds to a growing body of research suggesting that observable associations between complex human traits and neural parameters might be rather small and proposes multimodal prediction approaches as promising tool to derive robust brain-behavior relations despite limited sample sizes.

show abstract

Section: Discussionmentioning

confidence: 61%

Section: Discussionmentioning

confidence: 61%

Multimodal Brain Signal Complexity Predicts Human Intelligence

Thiele

Richter

Hilger

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Adding flexibility, predictive algorithms built on top of these large datasets typically involve a great number of investigator decisions -the combined effects of which undermine reliability of findings [for an example in connectivity modeling see Hallquist and Hillary, 2018]. Results of machine learning models, for example, are sensitive to model specification and parameter tuning [Pineau et al, 2021Bouthillier et al, 2019Cwiek et al, 2021]. Computational approaches permit systematically combing through a great number of potential variables of interest and their statistical relationships (specifically, at scales which would be manually infeasible).…”

Section: Big Data and Computational Methods As Friend And Foementioning

confidence: 99%

How failure to falsify in high-volume science contributes to the replication crisis

Hillary¹,

Rajtmajer²

2021

Preprint

View full text Add to dashboard Cite

Abstract:This critical review discusses evidence for the replication crisis in the clinical neuroscience literature with focus on the size of the literature and how scientific hypotheses are framed and tested. We aim to reinvigorate discussions born from philosophy of science regarding falsification (see Popper, 1959;1962) but with hope to bring pragmatic application that might give real leverage to attempts to address scientific reproducibility. The surging publication rate has not translated to unparalleled scientific progress so the current “science-by-volume” approach requires new perspective for determining scientific ground truths. We describe an example from the network neurosciences in the study of traumatic brain injury where there has been little effort to refute two prominent hypotheses leading to a literature without resolution. Based upon this example, we discuss how building strong hypotheses and then designing efforts to falsify them can bring greater precision to the clinical neurosciences. With falsification as the goal, we can harness big data and computational power to identify the fitness of each theory to advance the neurosciences.

show abstract

“…The strength of a hypothesis refers to how specific and how refutable it is ( Popper, 1963 ; see Table 1 for examples). We also argue for greater emphasis on testing and refuting strong hypotheses through a “team science” framework that allows us to address the heterogeneity in samples and/or methods that makes so many published findings tentative ( Cwiek et al, 2021 ; Bryan et al, 2021 ).…”

Section: Background and Motivationmentioning

confidence: 99%

“…Adding flexibility, predictive algorithms built on top of these large datasets typically involve a great number of investigator decisions – the combined effects of which undermine reliability of findings [for an example in connectivity modeling see Hallquist and Hillary, 2019 ]. Results of machine learning models, for example, are sensitive to model specification and parameter tuning ( Pineau, 2021 ; Bouthillier et al, 2019 ; Cwiek et al, 2021 ). Computational approaches permit systematically combing through a great number of potential variables of interest and their statistical relationships (specifically, at scales which would be manually infeasible).…”

Section: Background and Motivationmentioning

confidence: 99%

How failure to falsify in high-volume science contributes to the replication crisis

Rajtmajer

Errington

Hillary

2022

eLife

Self Cite

View full text Add to dashboard Cite

The number of scientific papers published every year continues to increase, but scientific knowledge is not progressing at the same rate. Here we argue that a greater emphasis on falsification - the direct testing of strong hypotheses - would lead to faster progress by allowing well-specified hypotheses to be eliminated. We describe an example from neuroscience where there has been little work to directly test two prominent but incompatible hypotheses related to traumatic brain injury. Based on this example, we discuss how building strong hypotheses and then setting out to falsify them can bring greater precision to the clinical neurosciences, and argue that this approach could be beneficial to all areas of science.

show abstract

Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics

Cited by 17 publications

References 81 publications

Multimodal Brain Signal Complexity Predicts Human Intelligence

Multimodal Brain Signal Complexity Predicts Human Intelligence

How failure to falsify in high-volume science contributes to the replication crisis

How failure to falsify in high-volume science contributes to the replication crisis

Contact Info

Product

Resources

About