Scaling Evolutionary Programming With the Use of Apache Spark

Funika, Włodzimierz; Koperek, Paweł

doi:10.7494/csci.2016.17.1.69

Cited by 2 publications

(4 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The datasets used, generated, and analyzed during the current study are available in publicly accessible repository 20 or can be provided from the corresponding author on a reasonable request.…”

Section: Data Availability Statementmentioning

confidence: 99%

“…This allows the management system to respond to a potentially changing workload while addressing the issues described above. This article's contribution comprises a novel architecture of an autonomous management system which utilizes a continuous policy improvement loop, initial policy training procedure, implementation of the described concepts available as an open source project, 20 experiments demonstrating the functioning of such a management system (comparison with a static policy, reacting to changes in environment) and analysis of their results.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Continuous self‐adaptation of control policies in automatic cloud management

Funika

Koperek

Kitowski

2022

Concurrency and Computation

View full text Add to dashboard Cite

Deep reinforcement learning has been recently a very active field of research. The policies generated with the use of this class of training algorithms are flexible and thus have many practical applications. In this article we present the results of our attempt to use the recent advancements in reinforcement learning to automate the management of resources in a compute cloud environment. We describe a new approach to self-adaptation of autonomous management, which uses a digital clone of the managed infrastructure to continuously update the control policy. We present the architecture of our system and discuss the results of evaluation which includes autonomous management of a sample application deployed to Amazon Web Services cloud. We also provide the details of the training of the management policy using the Proximal Policy Optimization algorithm. Finally, we discuss the feasibility to extend the presented approach to further scenarios.

show abstract

Section: Data Availability Statementmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Continuous self‐adaptation of control policies in automatic cloud management

Funika

Koperek

Kitowski

2022

Concurrency and Computation

View full text Add to dashboard Cite

show abstract

“…The reducer aggregates the previously created vectors. Funika et al [7] implement an 'Evaluation Service' that can be solicited through a REST API. It is not an implementation of a specific EA algorithm but rather an outsourcing of the evaluation.…”

Section: Previous Workmentioning

confidence: 99%

“…The evaluation represents more than 80% of the total time cost in EA [7,9]. We suggest to focus on evaluation which can be easily distributed on Spark cluster even with limited resources seeing that we do not need independent populations.…”

Section: Implementation Modelmentioning

confidence: 99%

Genetic Programming over Spark for Higgs Boson Classification

Hmida

Hamida

Borgi

et al. 2019

Business Information Systems

View full text Add to dashboard Cite

With the growing number of available databases having a very large number of records, existing knowledge discovery tools need to be adapted to this shift and new tools need to be created. Genetic Programming (GP) has been proven as an efficient algorithm in particular for classification problems. Notwithstanding, GP is impaired with its computing cost that is more acute with large datasets. This paper, presents how an existing GP implementation (DEAP) can be adapted by distributing evaluations on a Spark cluster. Then, an additional sampling step is applied to fit tiny clusters. Experiments are accomplished on Higgs Boson classification with different settings. They show the benefits of using Spark as parallelization technology for GP. Keywords: Genetic Programming • Machine learning • Spark • Large dataset • Higgs Boson classification 1 'A data lake is a collection of storage instances of various data assets additional to the originating data sources.' (Source: Gartner).

show abstract

Scaling Evolutionary Programming With the Use of Apache Spark

Cited by 2 publications

References 14 publications

Continuous self‐adaptation of control policies in automatic cloud management

Continuous self‐adaptation of control policies in automatic cloud management

Genetic Programming over Spark for Higgs Boson Classification

Contact Info

Product

Resources

About