Mind the gap: performance metric evaluation in brain-age prediction

Lange, Ann-Marie G. de; Anatürk, Melis; Rokicki, Jaroslav; L-K, Han; Franke, Katja; Alnæs, Dag; Ebmeier, Klaus P.; Draganski, Bogdan; Kaufmann, Tobias; Westlye, Lars T.; Hahn, Tim; Jh, Cole

doi:10.1101/2021.05.16.444349

Cited by 14 publications

(22 citation statements)

References 68 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Simultaneously, differences between diffusion approaches, and both variance explained and prediction error (RMSE, MAE) were smaller in this study. These differences are likely due to the narrower age range in our study 45 , whereas our 11 significantly larger sample emphasises the reliability of our findings.…”

Section: Consistency Across Diffusion Approachesmentioning

confidence: 65%

Brain-wide associations between white matter and age highlight the role of fornix microstructure in brain ageing

Korbmacher

Lange

Meer

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Identifying white matter (WM) microstructure parameters that reflect the underlying biology of the brain will advance our understanding of ageing and brain health. In this extensive comparison of brain age predictions and age-associations of WM features from different diffusion approaches, we analysed UK Biobank diffusion Magnetic Resonance Imaging (dMRI) data across midlife and older age (N = 35,749, 44.6 to 82.8 years of age). Conventional and advanced dMRI approaches were consistent in predicting brain age; with their WM-features similarly related to and predicted by age. However, brain age was estimated best when combining approaches, showing different aspects of WM to contribute to brain age. Fornix was found as the central region for brain age predictions across diffusion approaches. We encourage the application of multiple dMRI approaches for detailed insights into WM, and the further investigation of fornix as a potential biomarker of brain age and ageing.

show abstract

Section: Consistency Across Diffusion Approachesmentioning

confidence: 65%

Brain-wide associations between white matter and age highlight the role of fornix microstructure in brain ageing

Korbmacher

Lange

Meer

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Thus, the fact that the TOF MRA datasets include less information than the T1‐weighted MRI datasets might partly explain the difference in accuracy between the CNN T1 and CNN TOF models. When combining both modalities, the resulting mean absolute error significantly improves and is comparable to the results described in literature, with reported MAEs mostly varying between 3 to 5 years when using T1‐weighted MRI datasets (Bashyam et al, 2020; Cole et al, 2017; Jonsson et al, 2019; Levakov et al, 2020; Mouches, Wilms, Rajashekar, Langner, & Forkert, 2021; Peng et al, 2021; Wilms et al, 2020), but often using considerably more training data, and participants with a much narrower age range, which hinders a direct comparison of the results (de Lange et al, 2021). Moreover, Bashyam et al (2020) who previously trained a deep learning brain age prediction model using more than 11,000 datasets and tested it on the SHIP database reported a MAE of 4.12 years, showing that this database is rather challenging for the brain age prediction task.…”

Section: Discussionmentioning

confidence: 99%

“…First, while using a single database increases the consistency of the results and reduces biases, leading to more robust explanations, it also results in a model that is less robust to varying scanning parameters, and limits the amount of data available. Therefore, the model prediction accuracy would benefit from training using a larger sample size, as previously demonstrated in the context of brain age prediction (de Lange et al, 2021), and data collected from different centers, especially when using deep learning models, which are known to be data hungry (Marcus, 2018). Nevertheless, based on the excellent results of the SFCN architecture on the highly diverse PAC2019 brain age prediction data reported in Peng et al (2021), we assume that the general findings of this study will hold true even for multicenter datasets, especially when proper harmonization strategies are implemented to remove possible confounding biases.…”

Section: Limitationsmentioning

confidence: 99%

Multimodal biological brain age prediction using magnetic resonance imaging and angiography with the identification of predictive regions

Mouchès

Wilms

Rajashekar

et al. 2022

Human Brain Mapping

View full text Add to dashboard Cite

Biological brain age predicted using machine learning models based on high‐resolution imaging data has been suggested as a potential biomarker for neurological and cerebrovascular diseases. In this work, we aimed to develop deep learning models to predict the biological brain age using structural magnetic resonance imaging and angiography datasets from a large database of 2074 adults (21–81 years). Since different imaging modalities can provide complementary information, combining them might allow to identify more complex aging patterns, with angiography data, for instance, showing vascular aging effects complementary to the atrophic brain tissue changes seen in T1‐weighted MRI sequences. We used saliency maps to investigate the contribution of cortical, subcortical, and arterial structures to the prediction. Our results show that combining T1‐weighted and angiography MR data led to a significantly improved brain age prediction accuracy, with a mean absolute error of 3.85 years comparing the predicted and chronological age. The most predictive brain regions included the lateral sulcus, the fourth ventricle, and the amygdala, while the brain arteries contributing the most to the prediction included the basilar artery, the middle cerebral artery M2 segments, and the left posterior cerebral artery. Our study proposes a framework for brain age prediction using multimodal imaging, which gives accurate predictions and allows identifying the most predictive regions for this task, which can serve as a surrogate for the brain regions that are most affected by aging.

show abstract

“…One also needs to choose from a large pool of ML algorithms, such as random forest regression (RFR), relevance vector regression (RVR), and Gaussian process regression (GPR), many of which have shown success in brain-age estimation. These choices are known to affect performance (Gutierrez Becker et al ., 2018, Baecker et al ., 2021 a ; de Lange et al ., 2022). However, previous studies have performed only limited comparisons on the same data and setup.…”

Section: Introductionmentioning

confidence: 99%

“…This age bias complicates or may even mislead downstream individualized decision-making. It can be mitigated using bias correction models; usually, linear regression predicting brain-age or delta using chronological age (Le et al ., 2018; Liang et al ., 2019, Smith et al ., 2019 b ; de Lange et al ., 2022). These correction models are also used to counter the systematic under- or over-estimation of age in a novel site, usually reflected in the non-zero average delta in healthy controls.…”

Section: Introductionmentioning

confidence: 99%

Brain-age prediction: a systematic comparison of machine learning workflows

Antonopoulos

Hoffstaedter

et al. 2022

Preprint

View full text Add to dashboard Cite

The difference between age predicted using anatomical brain scans and chronological age, i.e., the brain-age delta, provides a proxy for atypical aging. Various data representations and machine learning (ML) algorithms have been used for brain-age estimation. However, how these choices compare on performance criteria important for real-world applications, such as; (1) within-site accuracy, (2) cross-site generalization, (3) test-retest reliability, and (4) longitudinal consistency, remains uncharacterized. We evaluated 128 workflows consisting of 16 feature representations derived from gray matter (GM) images and eight ML algorithms with diverse inductive biases. Using four large neuroimaging databases covering the adult lifespan (total N = 2953, 18-88 years), we followed a systematic model selection procedure by sequentially applying stringent criteria. The 128 workflows showed a within-site mean absolute error (MAE) between 4.73-8.38 years, from which 32 broadly sampled workflows showed a cross-site MAE between 5.23-8.98 years. The test-retest reliability and longitudinal consistency of the top 10 workflows were comparable. The choice of feature representation and the ML algorithm both affected the performance. Specifically, voxel-wise feature spaces (smoothed and resampled), with and without principal components analysis, with non-linear and kernel-based ML algorithms performed well. Strikingly, the correlation of brain-age delta with behavioral measures disagreed between within-site and cross-site predictions. Application of the best-performing workflow on the ADNI sample showed a significantly higher brain-age delta in Alzheimer's and mild cognitive impairment patients. However, in the presence of age bias, the delta estimates in the diseased population varied depending on the sample used for bias correction. Taken together, brain-age shows promise, but further evaluation and improvements are needed for its real-world application.

show abstract

Mind the gap: performance metric evaluation in brain-age prediction

Cited by 14 publications

References 68 publications

Brain-wide associations between white matter and age highlight the role of fornix microstructure in brain ageing

Brain-wide associations between white matter and age highlight the role of fornix microstructure in brain ageing

Multimodal biological brain age prediction using magnetic resonance imaging and angiography with the identification of predictive regions

Brain-age prediction: a systematic comparison of machine learning workflows

Contact Info

Product

Resources

About