Dynamic heuristic acceleration of linearly approximated SARSA($$\lambda $$): using ant colony optimization to learn heuristics dynamically

Bromuri, Stefano

doi:10.1007/s10732-019-09408-x

Cited by 2 publications

(2 citation statements)

References 40 publications

(55 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When it receives a new tuple (s, r, d), it will return an action following a policy depending on the implemented algorithm ( -greedy for example). The model has a Q state variable which is an SxA dimension matrix for implementing the learning algorithm (Q-learning or SARSA [4] for example). When convergence is reached (depending on the values of Q), the model becomes passive and no longer responds to the environment.…”

Section: Motivation and Contributionmentioning

confidence: 99%

Discrete-Event Simulation-Based Q-Learning Algorithm Applied to Financial Leverage Effect

2019

View full text Add to dashboard Cite

Discrete-event modeling and simulation and machine learning are two frameworks suited for system of systems modeling which when combined can give a powerful tool for system optimization and decision making. One of the less explored application domains is finance, where this combination can propose a driven tool to investor. This paper presents a discreteevent specification as a universal framework to implement a machine learning algorithm into a modular and hierarchical environment. This approach has been validated on a financial leverage effect based on a Markov decision-making policy.

show abstract

Section: Motivation and Contributionmentioning

confidence: 99%

Discrete-Event Simulation-Based Q-Learning Algorithm Applied to Financial Leverage Effect

2019

View full text Add to dashboard Cite

show abstract

“…The mechanism of the ACS algorithm is inspired by the natural behavior of biological ant colonies [2] [3]. The ACS algorithm is built on notation of the reinforcement learning concept [4]. It uses ants as agents to manipulate environment via use of pheromone trails to find the shortest path.…”

Section: Introductionmentioning

confidence: 99%

Distributed Multi-Ant Colony System Algorithm using Raspberry Pi Cluster for Travelling Salesman Problem

Alobaedy

Khalaf

Fazea

2022

eijs

View full text Add to dashboard Cite

The traveling salesman problem is addressed in this paper by introducing a distributed multi-ant colony algorithm that is implemented on a Raspberry Pi cluster. The implementation of a master and eight workers, each running on Raspberry Pi nodes, is the central component of this novel technique. Each worker is responsible for managing their own colony of ants, while the master coordinates communications among workers’ nodes and assesses the most optimal approach. To put the newly built cluster through its paces, several datasets of traveling salesman problem are used to test the created cluster. The findings of the experiment indicate that a single board computer cluster, which makes use of multi-ant colony algorithm, is a viable alternative for distributed computing. This approach's extensibility options are extensively discussed as well.

show abstract

Dynamic heuristic acceleration of linearly approximated SARSA($$\lambda $$): using ant colony optimization to learn heuristics dynamically

Cited by 2 publications

References 40 publications

Discrete-Event Simulation-Based Q-Learning Algorithm Applied to Financial Leverage Effect

Discrete-Event Simulation-Based Q-Learning Algorithm Applied to Financial Leverage Effect

Distributed Multi-Ant Colony System Algorithm using Raspberry Pi Cluster for Travelling Salesman Problem

Contact Info

Product

Resources

About