2021
DOI: 10.1007/s00500-021-05590-y
|View full text |Cite
|
Sign up to set email alerts
|

A new imputation method based on genetic programming and weighted KNN for symbolic regression with incomplete data

Abstract: Incompleteness is one of the problematic data quality challenges in real-world machine learning tasks. A large number of studies have been conducted for addressing this challenge. However, most of the existing studies focus on the classification task and only a limited number of studies for symbolic regression with missing values exist . In this work, a new imputation method for symbolic regression with incomplete data is proposed. The method aims to improve both the effectiveness and efficiency of imputing mi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 48 publications
(12 citation statements)
references
References 38 publications
0
12
0
Order By: Relevance
“…Then, it repeatedly creates new individuals with the crossover, mutation and reproduction operators, until some stopping criteria are met. GP is a powerful search mechanism that has been successfully applied to various problems [30], [14]. Since computer heuristics are computer programs in nature, GP can be utilised as a Hyper Heuristics (GPHH) for evolving them automatically.…”
Section: B Related Work 1) Methods For Ucarpmentioning
confidence: 99%
See 1 more Smart Citation
“…Then, it repeatedly creates new individuals with the crossover, mutation and reproduction operators, until some stopping criteria are met. GP is a powerful search mechanism that has been successfully applied to various problems [30], [14]. Since computer heuristics are computer programs in nature, GP can be utilised as a Hyper Heuristics (GPHH) for evolving them automatically.…”
Section: B Related Work 1) Methods For Ucarpmentioning
confidence: 99%
“…GP-based transfer learning and optimisation has been applied to a range of problems including symbolic regression [14], [15] and UCARP [16], [17]. However, previous studies have found that the GP process can lose its population diversity [16], [18], [19].…”
Section: Introductionmentioning
confidence: 99%
“…While the other hand, the unsupervised algorithm learns on its own, from the unlabelled data to extract features and patterns for missing data [14]. In some cases, hybrid approaches [15][16][17][18][19], have been utilized to solve the weaknesses of the traditional supervised and unsupervised imputation methods. However, it is important to note that the only suitable solution comes down to a virtuous design and good analysis [20].…”
Section: Introductionmentioning
confidence: 99%
“…GP takes a population of candidate solutions (computer programs) and progressively evolves better solutions by applying operators analogous to natural genetic processes such as mutation and crossover on the population. Typical applications of GP includes classification [218,259,194,32] and regression [10,9,22,96].…”
Section: Genetic Programming (Gp)mentioning
confidence: 99%
“…In [10], GP is hybridized with KNN to develop an imputation method that estimates the missing values. The KNN method is used to retrieve instances similar to the incomplete data and GP uses those retrieved instances to build regression models that predict the missing values.…”
Section: Gp and Symbolic Regressionmentioning
confidence: 99%