2016
DOI: 10.7494/csci.2016.17.1.69
|View full text |Cite
|
Sign up to set email alerts
|

Scaling Evolutionary Programming With the Use of Apache Spark

Abstract: Organizations across the globe gather more and more data, encouraged by easyto-use and cheap cloud storage services. Large datasets require new approaches to analysis and processing, which include methods based on machine learning. In particular, symbolic regression can provide many useful insights. Unfortunately, due to high resource requirements, use of this method for large-scale dataset analysis might be unfeasible. In this paper, we analyze a bottleneck in the open-source implementation of this method we … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…The datasets used, generated, and analyzed during the current study are available in publicly accessible repository 20 or can be provided from the corresponding author on a reasonable request.…”
Section: Data Availability Statementmentioning
confidence: 99%
See 1 more Smart Citation
“…The datasets used, generated, and analyzed during the current study are available in publicly accessible repository 20 or can be provided from the corresponding author on a reasonable request.…”
Section: Data Availability Statementmentioning
confidence: 99%
“…This allows the management system to respond to a potentially changing workload while addressing the issues described above. This article's contribution comprises a novel architecture of an autonomous management system which utilizes a continuous policy improvement loop, initial policy training procedure, implementation of the described concepts available as an open source project, 20 experiments demonstrating the functioning of such a management system (comparison with a static policy, reacting to changes in environment) and analysis of their results.…”
Section: Introductionmentioning
confidence: 99%
“…The reducer aggregates the previously created vectors. Funika et al [7] implement an 'Evaluation Service' that can be solicited through a REST API. It is not an implementation of a specific EA algorithm but rather an outsourcing of the evaluation.…”
Section: Previous Workmentioning
confidence: 99%
“…The evaluation represents more than 80% of the total time cost in EA [7,9]. We suggest to focus on evaluation which can be easily distributed on Spark cluster even with limited resources seeing that we do not need independent populations.…”
Section: Implementation Modelmentioning
confidence: 99%