Hyperparameters and tuning strategies for random forest

Probst, Philipp; Wright, Marvin N.; Boulesteix, Anne‐Laure

doi:10.1002/widm.1301

Cited by 1,071 publications

(761 citation statements)

References 51 publications

Supporting

Mentioning

739

Contrasting

Unclassified

Order By: Relevance

“…We validated the modular decomposition using permutation testing on three levels: (i) the individual unthresholded FC-matrix and (ii) the co-classification matrix of each participant, as well as (iii) the co-classification matrix at the group level (Dwyer et al 2014 leakage of information from the test to the training data. We evaluated the performance of the classifier using (i) a nested-cross validation (leaving out the two observations corresponding to one subject for testing) and (ii) an inner validation approach for the hyperparameter optimization of the random forest-classifier (using a sequential model-based optimization implemented by the Scikit-Optimize library; skopt, https://github.com/scikit-optimize/scikit-optimize), iteratively tuning the following parameters following the recommendations by Probst et al (Probst, Wright and Boulesteix 2019): maximum depth of the tree, number of features, minimum number of samples and minimum number of samples required to be at a leaf node . We statistically validated the observed accuracy using permutation testing ( p < 0.05, 5000 iterations) randomizing the class labels.…”

Section: Modularity Analysismentioning

confidence: 99%

The physiological effects of non-invasive brain stimulation fundamentally differ across the human cortex

Castrillón

Sollmann

Kurcyus

et al. 2019

Preprint

View full text Add to dashboard Cite

Non-invasive brain stimulation reliably modulates brain activity and symptoms of neuropsychiatric disorders. However, stimulation effects substantially vary across individuals and brain regions. We combined transcranial magnetic stimulation (TMS) and functional magnetic resonance imaging (fMRI) to investigate the neuronal basis of inter-individual and inter-areal differences after TMS. We found that stimulating sensory and cognitive areas yielded fundamentally heterogeneous effects. Stimulation of occipital cortex enhanced brain-wide functional connectivity and biophysical modeling identified increased local inhibition and enhanced forward-signaling after TMS. Conversely, frontal stimulation decreased functional connectivity, associated with local disinhibition and disruptions of both feedforward and feedback connections. Finally, we identified brain-wide functional integration as a predictive marker for these heterogeneous stimulation effects in individual subjects.Together, our study suggests that modeling of local and global signaling parameters of a target area will improve the specificity of non-invasive brain stimulation for research and clinical applications.

show abstract

Section: Modularity Analysismentioning

confidence: 99%

The physiological effects of non-invasive brain stimulation fundamentally differ across the human cortex

Castrillón

Sollmann

Kurcyus

et al. 2019

Preprint

View full text Add to dashboard Cite

show abstract

“…We used the open-source R software environment for statistical computing and graphics (version 3.5.0) under an integrated development environment for R -RStudio 295 (RStudio Desktop version 1.1.447) to analyse data assembled on step V. For regression tasks we used the ranger package (version 0.10.1) as an implementation of the random forests (Wright and Ziegler 2017). To obtain the most accurate predictions, the random forest parameters need to be optimised (Probst et al 2018). To configure the parameters of the random forest, we used the tuneRanger (version 0.3) package (Probst et al 2018) 300 which allows model-based optimization for tuning strategy and the three parameters min.node.size, sample.fraction and mtry tuning at once.…”

Section: Spatial Data Analysismentioning

confidence: 99%

“…To obtain the most accurate predictions, the random forest parameters need to be optimised (Probst et al 2018). To configure the parameters of the random forest, we used the tuneRanger (version 0.3) package (Probst et al 2018) 300 which allows model-based optimization for tuning strategy and the three parameters min.node.size, sample.fraction and mtry tuning at once. Out-of-bag predictions were used for evaluation.…”

Section: Spatial Data Analysismentioning

confidence: 99%

Character displacement within the breeding area questions reinforcement inFicedulaflycatchers

Grinkov

Palko

Sternberg³

2019

Preprint

View full text Add to dashboard Cite

10At present, studies of reinforcement should be focused on demonstrating how often this process occurs in nature and how important it is for speciation. Here we study the character displacement within the breeding area in the Pied Flycatcher to check the validity of the reinforcement in Ficedula flycatchers. We used point-referenced spatial data and a random forest to find the most important explanatory factors of the character 15 displacement, and to reconstruct the phenotypic structure of the populations. The environmental temperature, and not the distance to sympatry, were proven to better describe the geographic pattern of the mean breeding plumage colour of the Pied Flycatcher populations. We conclude that ecologically distinct adaptations drive the morphological differentiation of the Old World flycatchers, and not reinforcement. 20

show abstract

“…Compared with logistic regression, random forests requires tuning for some of the parameters. We used the function OOBCurve in the homonymous R package (Probst and Boulesteix, ) to tune the number of trees, whereas we use the latest implementations in the tuneRanger R package (Probst et al ., ) to tune the number of variables randomly sampled as candidates at each split, the minimal size of terminal nodes and the fraction of observations to sample (function tuneRanger). In all cases, we choose the area under the curve (AUC) as the performance criterion for tuning.…”

Section: Assessing the Effect Of Company And Network Data On Credit Riskmentioning

confidence: 99%

The Effect of Interfirm Financial Transactions on the Credit Risk of Small and Medium-Sized Enterprises

Vinciotti

Tosetti

Moscone

et al. 2019

Journal of the Royal Statistical Society Series A: Statistics in Society

View full text Add to dashboard Cite

Summary Despite the recognized importance of interfirm financial links in determining a company's performance, only a few studies have incorporated proxies for interfirm links in credit risk models, and none of these use real financial transactions. We estimate a credit risk model for small and medium‐sized enterprises, augmented with information on observed interfirm financial transactions. We exploit a novel data set on about 60000 companies based in the UK and their financial transactions over the years 2015 and 2016. We develop several network‐augmented credit risk models and compare their prediction performance with that of a conventional credit risk model that includes only a set of financial ratios. We find that augmenting a default risk model with information on the transaction network makes a significant contribution to increasing the default prediction power of risk models built specifically for small and medium‐sized enterprises. Our results may help bankers and credit scoring agencies to improve the credit scoring of these companies, ultimately reducing their propensity to apply excessive lending restrictions.

show abstract

Hyperparameters and tuning strategies for random forest

Cited by 1,071 publications

References 51 publications

The physiological effects of non-invasive brain stimulation fundamentally differ across the human cortex

The physiological effects of non-invasive brain stimulation fundamentally differ across the human cortex

Character displacement within the breeding area questions reinforcement inFicedulaflycatchers

The Effect of Interfirm Financial Transactions on the Credit Risk of Small and Medium-Sized Enterprises

Contact Info

Product

Resources

About