2021
DOI: 10.1109/jsen.2020.3044049
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning Framework for Delay Sensitive Energy Harvesting Wireless Sensor Networks

Abstract: A multi-hop energy harvesting wireless sensor network (EH-WSNs) is a key enabler for future communication systems such as the internet-of-things. Optimal power management and routing selection are important for the operation and successful deployment of EH-WSNs. Characterizing the optimal policies increases significantly with the number of nodes in the network. In this paper, optimal control policy is devised based on minimum-delay transmission in a multi-hop EH-WSN using reinforcement learning (RL). The WSN c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 41 publications
0
5
0
Order By: Relevance
“…The predecessor of the PPO is trust region policy optimization (TRPO). PPO draws on the basic ideas of TRPO and importance sampling to approximate the objective function in (23) to obtain the surrogate objective function. The surrogate objective function can be expressed as:…”
Section: Brief Review Of Ppomentioning
confidence: 99%
See 1 more Smart Citation
“…The predecessor of the PPO is trust region policy optimization (TRPO). PPO draws on the basic ideas of TRPO and importance sampling to approximate the objective function in (23) to obtain the surrogate objective function. The surrogate objective function can be expressed as:…”
Section: Brief Review Of Ppomentioning
confidence: 99%
“…In order to tackle the access and continuous power control problems simultaneously, [22] proposed an RL method based on actor-critic DQN that takes into account both the sum-rate and prediction loss. In [23], RL is used to develop an optimal control policy based on minimum-delay transmission in a multi-hop energy harvesting wireless sensor network. [24] proposed a novel RL method based on value function approximation to tackle the scheduling problem online.…”
Section: Introductionmentioning
confidence: 99%
“…Let's denote the improvement degree of the chosen individual when one LS operator is employed to refine it, which can be determined using Eq. (10).…”
Section: Adaptive Hill Climbingmentioning
confidence: 99%
“…This research work proposes a novel hybrid algorithm to perform the clustering process in wireless sensor networks. The research work is composed of dual efficient algorithms namely the Memetic algorithm to perform the energy-efficient clustering process [10] while the adaptive hill-climbing algorithm is to identify the critical shortest path among the nodes in the Wireless Sensor Networks. The proposed clustering mechanism is proved to be energy efficient as it transfers the data to the neighbor node which is located at the shortest distance.…”
Section: Introductionmentioning
confidence: 99%
“…optimal resource allocation, energy-efficient communication, routing selection, etc.) in WSNs are being explored due to its prominence compared to other traditional approaches [9], [10], [11]. RL is an area of machine learning where an intelligent agent takes actions in a simulated environment and learns the optimal policy by trial-and-error process over time.…”
Section: Introductionmentioning
confidence: 99%