2023
DOI: 10.3390/computers12020022
|View full text |Cite
|
Sign up to set email alerts
|

A Centralized Routing for Lifetime and Energy Optimization in WSNs Using Genetic Algorithm and Least-Square Policy Iteration

Abstract: Q-learning has been primarily used as one of the reinforcement learning (RL) techniques to find the optimal routing path in wireless sensor networks (WSNs). However, for the centralized RL-based routing protocols with a large state space and action space, the baseline Q-learning used to implement these protocols suffers from degradation in the convergence speed, network lifetime, and network energy consumption due to the large number of learning episodes required to learn the optimal routing path. To overcome … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 46 publications
0
3
0
Order By: Relevance
“…Least-Square Policy Iteration is proposed by Obi et al [17] as an effective model-free Reinforcement Learning-based technique for optimization in WSNs. The subsequent protocol architecture is a centralized routing procedure for Lifetime Energy Optimization (LEOP) with a GA and Least-Square Policy Iteration (LSPI) for generation and energy optimization (CRPLEOGALSPI).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Least-Square Policy Iteration is proposed by Obi et al [17] as an effective model-free Reinforcement Learning-based technique for optimization in WSNs. The subsequent protocol architecture is a centralized routing procedure for Lifetime Energy Optimization (LEOP) with a GA and Least-Square Policy Iteration (LSPI) for generation and energy optimization (CRPLEOGALSPI).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Xue et al [19], introduces the cross-layer-based Harris-hawks-optimization-algorithm (CL-HHO) routing protocol for WSN and k-medoids with improved artificial bee colony (K-IABC)-based energy-efficient clustering to achieve higher quality-of-service (QoS) performance. Least-Square Policy Iteration (LSPI), an effective model-free RL-based approach, is suggested by obi et al [20], to optimise network lifespan and energy usage in WSNs. A Centralised Routing Protocol for Lifetime and Energy Optimisation with a Genetic Algorithm (GA) and LSPI (CRPLEOGALSPI) is the designed protocol that was produced, which was not affected by the learning rate, selects a routing path in a given state after taking into account all feasible routing pathways.…”
Section: Literature Surveymentioning
confidence: 99%
“…Cyber-physical test bed experiments and theoretical closed-loop stability analysis enable smart manufacturing scenarios. This paper ends with questions and research ideas.GA-LSPI-based WSN centralized routing protocol was developed for lifetime and energy optimization (6) . Sink GAs generate network graph routing tables in polynomial time.…”
Section: Introductionmentioning
confidence: 99%