2019
DOI: 10.1109/tsp.2019.2924579
|View full text |Cite
|
Sign up to set email alerts
|

Joint Rate and Power Optimization for Multimedia Streaming in Wireless Fading Channels via Parametric Policy Gradient

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 33 publications
0
4
0
Order By: Relevance
“…On the other hand, policy-based reinforcement learning plays an important role in the optimization of wireless channel. [22] focused on the joint rate and power optimization in the fading channel by using policy gradient with deterministic policy. Innovatively, Actor-Critic method, as the combination of policy gradient and tabular Q-learning, performed well in the problem of user scheduling and resource allocation in [23].…”
Section: A Prior Workmentioning
confidence: 99%
“…On the other hand, policy-based reinforcement learning plays an important role in the optimization of wireless channel. [22] focused on the joint rate and power optimization in the fading channel by using policy gradient with deterministic policy. Innovatively, Actor-Critic method, as the combination of policy gradient and tabular Q-learning, performed well in the problem of user scheduling and resource allocation in [23].…”
Section: A Prior Workmentioning
confidence: 99%
“…However, in real world, 'fast' dynamics and 'slow' dynamics may coexist in one system, which can be modeled as a singularly perturbation system, such as circuit systems, high-speed aircraft systems, etc. [31][32][33][34]. Recently, SPCNs have attracted considerable concern of many scholars [35][36][37][38][39][40][41][42][43][44].…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, a DNN task may have to span multiple time slots to finish. In addition, the wireless channels are also time-varying due to the mobility of the WDs [29], [30]. Thus, the existing static optimization approach fails to embrace the dynamic nature of the DNN task offloading problem.…”
Section: Introductionmentioning
confidence: 99%