2022
DOI: 10.1109/twc.2022.3144618
|View full text |Cite
|
Sign up to set email alerts
|

A Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks

Abstract: In this paper, we propose a deep state-actionreward-state-action (SARSA) λ learning approach for optimising the uplink resource allocation in non-orthogonal multiple access (NOMA) aided ultra-reliable low-latency communication (URLLC). To reduce the mean decoding error probability in time-varying network environments, this work designs a reliable learning algorithm for providing a long-term resource allocation, where the reward feedback is based on the instantaneous network performance. With the aid of the pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
3
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 37 publications
0
3
0
Order By: Relevance
“…On the other hand, rate fairness between users or QoS is another crucial issue that should be ensured in NOMA systems 11,12 . This is because if rate fairness or quality of service requirements are not placed on the system, all transmit power would be distributed to the single user with the best channel quality, resulting in utter unfairness among users and the conversion of NOMA to OMA 13–15 …”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, rate fairness between users or QoS is another crucial issue that should be ensured in NOMA systems 11,12 . This is because if rate fairness or quality of service requirements are not placed on the system, all transmit power would be distributed to the single user with the best channel quality, resulting in utter unfairness among users and the conversion of NOMA to OMA 13–15 …”
Section: Introductionmentioning
confidence: 99%
“…11,12 This is because if rate fairness or quality of service requirements are not placed on the system, all transmit power would be distributed to the single user with the best channel quality, resulting in utter unfairness among users and the conversion of NOMA to OMA. [13][14][15] Many strategies for the optimization of resource allocation in a wireless multiuser OFDM system have been proposed and have shown promising results. However, such strategies have not been considered yet in a NOMA-based system where additional design constraints should be taken into account.…”
mentioning
confidence: 99%
“…The study [15] introduces a deep learning methodology, SARSA λ, for optimizing uplink random access in NOMA-assisted URLLC networks. The algorithm is designed to mitigate decoding inaccuracies in dynamic communication setups and tackles issues related to user grouping, RA optimization, and instantaneous feedback mechanisms.…”
Section: Introductionmentioning
confidence: 99%