2019
DOI: 10.1007/s10489-019-01592-4
|View full text |Cite
|
Sign up to set email alerts
|

Evolutionary dataset optimisation: learning algorithm quality through evolution

Abstract: In this paper we propose a new method for learning how algorithms perform. Classically, algorithms are compared on a finite number of existing (or newly simulated) benchmark data sets based on some fixed metrics. The algorithm(s) with the smallest value of this metric are chosen to be the 'best performing'.We offer a new approach to flip this paradigm. We instead aim to gain a richer picture of the performance of an algorithm by generating artificial data through genetic evolution, the purpose of which is to c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 36 publications
(18 reference statements)
0
2
0
Order By: Relevance
“…The value of 0 implies no distinct association, and the value of −1 indicates the wrong assignment to a cluster [ 36 ]. Silhouette coefficient is employed by scholars widely to calculate the optimal number of clusters [ 37 , 38 ]. Let us assume that A and C are two different clusters and i ′ ∈ A .…”
Section: Preliminariesmentioning
confidence: 99%
“…The value of 0 implies no distinct association, and the value of −1 indicates the wrong assignment to a cluster [ 36 ]. Silhouette coefficient is employed by scholars widely to calculate the optimal number of clusters [ 37 , 38 ]. Let us assume that A and C are two different clusters and i ′ ∈ A .…”
Section: Preliminariesmentioning
confidence: 99%
“…All of the results leading up to this point were conducted using benchmark datasets and while there are certainly benefits to comparing methods in this way, it does not afford a rich understanding of how any of them perform more generally. This stage of the analysis relies on a method for generating artificial datasets introduced in [25]. In essence, this method is an evolutionary algorithm which acts on entire datasets to explore the space in which potentially all possible datasets exist.…”
Section: Artificial Datasetsmentioning
confidence: 99%