2020
DOI: 10.1007/978-3-030-43823-4_10
|View full text |Cite
|
Sign up to set email alerts
|

The autofeat Python Library for Automated Feature Engineering and Selection

Abstract: This paper describes the autofeat Python library, which provides a scikit-learn style linear regression model with automated feature engineering and selection capabilities. Complex non-linear machine learning models such as neural networks are in practice often difficult to train and even harder to explain to non-statisticians, who require transparent analysis results as a basis for important business decisions. While linear models are efficient and intuitive, they generally provide lower prediction accuracies… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 49 publications
(30 citation statements)
references
References 28 publications
0
23
0
Order By: Relevance
“…The R 2 for linear regression, however, increased considerably from 0.80 to 0.96. This is rather expected considering that even on benchmark datasets, the model showed remarkable improvements on linear regression tasks but could not outperform the state-ofthe-art RF regression models 36 .…”
Section: Methodsmentioning
confidence: 91%
See 1 more Smart Citation
“…The R 2 for linear regression, however, increased considerably from 0.80 to 0.96. This is rather expected considering that even on benchmark datasets, the model showed remarkable improvements on linear regression tasks but could not outperform the state-ofthe-art RF regression models 36 .…”
Section: Methodsmentioning
confidence: 91%
“…In fact, an earlier study on protein-ligand binding affinity also exhibited that predictions based on RF and decision trees consistently outperformed linear regression models 35 . In order to further augment the predictive power of our models, we implemented an automated feature engineering and selection using the autofeat library of python 36 . It is a framework inspired by the SISSO algorithm 37 that automatically generates a large number of non-linear features from the input descriptors and then selects the most informative of them as additional features.…”
Section: Methodsmentioning
confidence: 99%
“…However, in the presence of manually selected features it could be feared that important information on the gait was in advance omitted (Dindorf et al, 2020). Therefore, automated feature selection methods are an alternative approach, providing discrete as well as time series parameters (Christ et al, 2018;Horn et al, 2020). However, these methods might yield abstract features that are difficult to interpret and therefore do not allow a straight forward deduction of a treatment approach.…”
Section: Tablementioning
confidence: 99%
“…," a python framework for automatic features generation, is used that automatically generates non-linear features from input and then selects important ones from generated features [35]. This library uses combination of two or more features to generate new ones.…”
Section: B Feature Engineeringmentioning
confidence: 99%