2016
DOI: 10.1007/978-3-319-45547-1_16
|View full text |Cite
|
Sign up to set email alerts
|

Automated Data Pre-processing via Meta-learning

Abstract: Abstract. A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way around. As a matter of fact, a dataset usually needs to be pre-processed. Taking into account all the possible pre-processing operators, there exists a staggeringly large number of alternatives and nonexperienced users become overwhelmed. We show that this problem can be addressed by … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…Over the last few years, a plethora of AutoML systems have been developed providing partial or complete ML automation, such as Auto-sklearn [7], TPOT [8],Auto-WEKA [9], ATM [10], as well as commercial systems such as Google AutoML 1 , RapidMiner 2 , Dar-winAI 3 , and DataRobo 4 . These tools range from automatic data preprocessing [19,20], automatic feature engineering [21,22] to automatic model selection [18,23] and automatic hyper-parameters tuning [24,25]. Some approaches attempt to automatically and simultaneously choose a learning algorithm and optimize its hyper-parameters.…”
Section: Automated Machine Learningmentioning
confidence: 99%
“…Over the last few years, a plethora of AutoML systems have been developed providing partial or complete ML automation, such as Auto-sklearn [7], TPOT [8],Auto-WEKA [9], ATM [10], as well as commercial systems such as Google AutoML 1 , RapidMiner 2 , Dar-winAI 3 , and DataRobo 4 . These tools range from automatic data preprocessing [19,20], automatic feature engineering [21,22] to automatic model selection [18,23] and automatic hyper-parameters tuning [24,25]. Some approaches attempt to automatically and simultaneously choose a learning algorithm and optimize its hyper-parameters.…”
Section: Automated Machine Learningmentioning
confidence: 99%
“…Otherwise, in the worst case, all the transformations may end up having the same impact. For instance, as a first approach we considered simple regression trees [24] as meta-learners, and they suffer from this problem. Their limitation is that they contain a discrete number of leaves, and hence a discrete number of possible predictions.…”
Section: Meta-learnermentioning
confidence: 99%
“…However, in our previous works [2,4,5], we showed that meta-learning can also be used to provide support specifically in the pre-processing step. This can be done by learning the impact of data pre-processing operators on the final result of the analysis.…”
Section: Meta-learning For Data Pre-processingmentioning
confidence: 99%