2017
DOI: 10.1142/s021821301760020x
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Creation of Machine Learning Workflows with Strongly Typed Genetic Programming

Abstract: Manual creation of machine learning ensembles is a hard and tedious task which requires an expert and a lot of time. In this work we describe a new version of the GP-ML algorithm which uses genetic programming to create machine learning workows (combinations of preprocessing, classification, and ensembles) automatically, using strongly typed genetic programming and asynchronous evolution. The current version improves the way in which the individuals in the genetic programming are created and allows for much la… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
10
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(10 citation statements)
references
References 5 publications
0
10
0
Order By: Relevance
“…Currently, Auto-ML methods [3,5,6,8,15] have been dealing with the optimization of complete ML pipelines. This means that, instead of just focusing on ML algorithms and their hyper-parameters, these methods are also concerned with other aspects of ML, such as data preprocessing (e.g., feature normalization or feature selection) and post-processing (e.g., classification probability calibration).…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Currently, Auto-ML methods [3,5,6,8,15] have been dealing with the optimization of complete ML pipelines. This means that, instead of just focusing on ML algorithms and their hyper-parameters, these methods are also concerned with other aspects of ML, such as data preprocessing (e.g., feature normalization or feature selection) and post-processing (e.g., classification probability calibration).…”
Section: Related Workmentioning
confidence: 99%
“…The Tree-Based Pipeline Optimization Tool (TPOT) [8], for instance, applies a canonical genetic programming (GP) algorithm to search for the most appropriate ML pipeline in the Scikit-Learn library. Considering a different evolutionary approach, the Genetic Programming for Machine Learning method (GP-ML) [6] uses a strongly typed genetic programming (STGP) method to restrict the Scikit-Learn pipelines in such a way that they are always meaningful from the machine learning point of view. Finally, the REsilient ClassifIcation Pipeline Evolution method (RECIPE) [3] adopts a grammar-based genetic programming (GGP) method to search for Scikit-Learn pipelines.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…There are also other recently proposed EAs for related AutoML tasks. In particular, the EAs proposed in [18][19][20] try to optimize an entire machine learning pipeline for a given dataset, including the choice of data preprocessing methods (like feature scaling operators and feature selection methods) and classification algorithm. By contrast, we focus on using EAs that recommend only classification algorithms.…”
Section: Introductionmentioning
confidence: 99%