2021
DOI: 10.3390/jpm11100978
|View full text |Cite
|
Sign up to set email alerts
|

Hyperparameter Tuning and Pipeline Optimization via Grid Search Method and Tree-Based AutoML in Breast Cancer Prediction

Abstract: Automated machine learning (AutoML) has been recognized as a powerful tool to build a system that automates the design and optimizes the model selection machine learning (ML) pipelines. In this study, we present a tree-based pipeline optimization tool (TPOT) as a method for determining ML models with significant performance and less complex breast cancer diagnostic pipelines. Some features of pre-processors and ML models are defined as expression trees and optimal gene programming (GP) pipelines, a stochastic … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9

Relationship

1
8

Authors

Journals

citations
Cited by 43 publications
(15 citation statements)
references
References 21 publications
0
15
0
Order By: Relevance
“…Automated machine learning (AutoML) is a tool that automates the process of developing a machine learning model that has lately gained popularity. AutoML libraries come in a variety of environment including Auto-WEKA [ 29 ], Tree-based Pipeline Optimisation Tool (TPOT) [ 30 , 31 ], Auto-Sklearn, and others [ 32 , 33 , 34 ]. In this context, AutoML runs through a dataset and suggests the most optimum algorithm with the parameters set.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Automated machine learning (AutoML) is a tool that automates the process of developing a machine learning model that has lately gained popularity. AutoML libraries come in a variety of environment including Auto-WEKA [ 29 ], Tree-based Pipeline Optimisation Tool (TPOT) [ 30 , 31 ], Auto-Sklearn, and others [ 32 , 33 , 34 ]. In this context, AutoML runs through a dataset and suggests the most optimum algorithm with the parameters set.…”
Section: Introductionmentioning
confidence: 99%
“…AutoML is the process of automating the application of machine learning to real-world situations from start to the end. AutoML can be explained mathematically as follows [ 31 ]: …”
Section: Introductionmentioning
confidence: 99%
“…We developed an easy-to-use model in the clinic by using only CYP2C19 genotypes and some noninvasive clinical parameters as predictors, and observed the influence of the omitted predictors on the performance of the proposed XGBoost model. Finally, we optimized the hyperparameters via the sklearn’s own grid search approach using the evaluation metric of MAE and tenfold cross-validation ( Radzi et al, 2021 ), and verified this simplified model after optimization in our independent external dataset, which consisted of 105 input-output data pairs retrospectively collected from our routine TDM practice according to guidelines of the Ethics Committee of the Affiliated Brain Hospital of Guangzhou Medical University approval ([2021] NO.027). The inputs to the external dataset were the same as those of the finally generated combined dataset with Single Dose, ALB, , and omitted.…”
Section: Methodsmentioning
confidence: 99%
“…Third, the hyperparameters for each base model were adjusted to yield better predictive performance via the grid search method in scikit-learn. It evaluated the given hyperparameter combinations of the model by using “neg_mean_absolute_error” as the evaluation metric and 10-fold cross-validation ( Radzi et al, 2021 ). This technique allows inexperienced data scientists to acquire recommendations for tuning the parameters, but may be time consuming and inefficient in case of a large number of parameters.…”
Section: Methodsmentioning
confidence: 99%