2020
DOI: 10.1109/tcomm.2019.2947418
|View full text |Cite
|
Sign up to set email alerts
|

Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approaches

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
28
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 69 publications
(28 citation statements)
references
References 22 publications
0
28
0
Order By: Relevance
“…Therefore, for the same state and action pair, the reward could be different in successive trials. A simple approach employed to get stable reward values is to execute successive trials and take the trials' average reward [35]. We determine the average reward values by executing consecutive trials for each state and action pair.…”
Section: Architecture Of the Dqn And Other Design Considerationsmentioning
confidence: 99%
“…Therefore, for the same state and action pair, the reward could be different in successive trials. A simple approach employed to get stable reward values is to execute successive trials and take the trials' average reward [35]. We determine the average reward values by executing consecutive trials for each state and action pair.…”
Section: Architecture Of the Dqn And Other Design Considerationsmentioning
confidence: 99%
“…Thus, we can further reduce the computational complexity of the proposed algorithm by a relatively lower number of iterations, which seems feasible to an extent for practical applications. Compared with optimization techniques, deep learning-based method has gained widespread popularity recently to deal with the resource allocation problem for NOMA systems in a non-iterative fashion, which has been demonstrated to be able to achieve good performance with reasonable training time [39,40,41]. However, it is out of the scope of this work and it will be a topic for our further study.…”
Section: Remarkmentioning
confidence: 99%
“…f,m according to (38), (39), (40), and (41) under the step size constraint of (42), (43), (44), and (45). as well as the power allocation alternatively, the iteration process is terminated when the convergence of the optimal power update is guaranteed or the maximum number of iterations is reached.…”
mentioning
confidence: 99%
“…Some researchers have applied DL to NOMA systems [ 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 ]. In [ 15 ], a DL-aided sparse code multiple access (SCMA) was proposed in which the mapping of data to the resource and the decoding of received signals is conducted with a deep neural network (DNN).…”
Section: Introductionmentioning
confidence: 99%