2021
DOI: 10.1109/tpami.2021.3062900
|View full text |Cite
|
Sign up to set email alerts
|

Adaptation Strategies for Automated Machine Learning on Evolving Data

Abstract: Automated Machine Learning (AutoML) systems have been shown to efficiently build good models for new datasets. However, it is often not clear how well they can adapt when the data evolves over time. The main goal of this study is to understand the effect of data stream challenges such as concept drift on the performance of AutoML methods, and which adaptation strategies can be employed to make them more robust. To that end, we propose 6 concept drift adaptation strategies and evaluate their effectiveness on di… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
48
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 54 publications
(49 citation statements)
references
References 32 publications
1
48
0
Order By: Relevance
“…Taking advantage of the recent wave of research in AutoML, an alternative approach to adaptation to changing environments was proposed in Martín Salvador et al (2016) where repeated automated deployment of Auto-WEKA for Multi-Component Predictive Systems (MCPS) to learn from new batches of data was used for life-long learning and the adaptation of complex MCPS when applied to changing streaming data from process industries. Celik and Vanschoren (2021) represent a development of this idea with the inclusion of the drift detection and the experimentation using several open source AutoML frameworks. An interesting approach closely tied with the Auto_Sklearn is described in Madrid et al (2019).…”
Section: Automated Machine Learning For Streaming Datamentioning
confidence: 99%
“…Taking advantage of the recent wave of research in AutoML, an alternative approach to adaptation to changing environments was proposed in Martín Salvador et al (2016) where repeated automated deployment of Auto-WEKA for Multi-Component Predictive Systems (MCPS) to learn from new batches of data was used for life-long learning and the adaptation of complex MCPS when applied to changing streaming data from process industries. Celik and Vanschoren (2021) represent a development of this idea with the inclusion of the drift detection and the experimentation using several open source AutoML frameworks. An interesting approach closely tied with the Auto_Sklearn is described in Madrid et al (2019).…”
Section: Automated Machine Learning For Streaming Datamentioning
confidence: 99%
“…This module allows the automatic training and tuning of classification and regression models within a user-specified time-limit [34]. Specifically, it performs a fast random search in combination with automated stacked ensembles, using random forests, gradient boosting machines, deep neural nets, or generalised linear models [35].…”
Section: Related Workmentioning
confidence: 99%
“…Some recent efforts have focused on the problem of AutoML for streaming applications, where the optimal algorithm and hyperparameters might change over time (e.g., Veloso et al 2018;Carnein et al 2020;Celik and Vanschoren 2020). Although this might be interesting for EarlyTSC, in this paper we consider only static problems.…”
Section: Automated Machine Learningmentioning
confidence: 99%