2024
DOI: 10.21203/rs.3.rs-4172933/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A practical evaluation of AutoML tools for binary, multiclass, and multilabel classification

Marcelo V. C. Aragão,
Augusto G. Afonso,
Rafaela C. Ferraz
et al.

Abstract: Choosing the right Automated Machine Learning (AutoML) tool is crucial for researchers of varying expertise to achieve optimal performance in diverse classification tasks. However, the abundance of AutoML frameworks with varying features makes selection challenging. This study addresses this gap by conducting a practical evaluation informed by a theoretical and bibliographical review and a feature-based comparison of twelve AutoML frameworks. The evaluation, conducted under time constraints, assessed accuracy … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(10 citation statements)
references
References 65 publications
0
10
0
Order By: Relevance
“…The datasets used in the experiments were the same employed in Aragão et al (2023). It was done once the presented datasets contained the necessary properties to apply to the current study and enable future comparisons between both studies.…”
Section: Datasetsmentioning
confidence: 99%
See 4 more Smart Citations
“…The datasets used in the experiments were the same employed in Aragão et al (2023). It was done once the presented datasets contained the necessary properties to apply to the current study and enable future comparisons between both studies.…”
Section: Datasetsmentioning
confidence: 99%
“…The hill-valley (1479) dataset is perfectly balanced once both classes have the same number of samples. This dataset was kept in the study since this feature must be lost in the data split step (described ahead) and to be compared with (Aragão et al, 2023). On the other hand, diabetes (37) and wdbc (1510) are the most imbalanced binary datasets.…”
Section: Datasetsmentioning
confidence: 99%
See 3 more Smart Citations