ICC 2019 - 2019 IEEE International Conference on Communications (ICC) 2019
DOI: 10.1109/icc.2019.8762084
|View full text |Cite
|
Sign up to set email alerts
|

Deep Multi-Agent Reinforcement Learning Based Cooperative Edge Caching in Wireless Networks

Abstract: The growing demand on high-quality and lowlatency multimedia services has led to much interest in edge caching techniques. Motivated by this, we in this paper consider edge caching at the base stations with unknown content popularity distributions. To solve the dynamic control problem of making caching decisions, we propose a deep actor-critic reinforcement learning based multi-agent framework with the aim to minimize the overall average transmission delay. To evaluate the proposed framework, we compare the le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
19
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 44 publications
(20 citation statements)
references
References 14 publications
1
19
0
Order By: Relevance
“…The real trace-driven evaluation results show that the proposed MAA2C reduces the latency around 21% and the cost around 26% compared with the state-of-art caching methods such as DRL [124], joint action learners [125]. The actor-critic framework for cooperative edge caching can also be found in [126]. However, unlike the unique model for each BS used in [123], the agent model of each BS in [126] contains a unique actor and a shared critic.…”
Section: B Content Cachingmentioning
confidence: 99%
“…The real trace-driven evaluation results show that the proposed MAA2C reduces the latency around 21% and the cost around 26% compared with the state-of-art caching methods such as DRL [124], joint action learners [125]. The actor-critic framework for cooperative edge caching can also be found in [126]. However, unlike the unique model for each BS used in [123], the agent model of each BS in [126] contains a unique actor and a shared critic.…”
Section: B Content Cachingmentioning
confidence: 99%
“…Example: Multi-agent task offloading [28] and multi-agent cooperative edge caching [29] are wireless problems which can be modeled as Dec-POMDP problems.…”
Section: ) Markov/stochastic Gamesmentioning
confidence: 99%
“…For edge caching, [29] and [104] propose MADDPG-like algorithms to solve the cooperative multi-agent edge caching problem. Both of these works model the cooperative edge caching as Dec-POMDP and differ in the definition of the state space and reward functions.…”
Section: Applications a Marl For Mec Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…Age of information has been proposed as an effective metric to quantify the freshness of information in communication networks. There have been lots of efforts on Age of information such as social networks [7], web crawling [8]- [10], queueing networks [11]- [14], caching systems [15]- [24], scheduling in networks [25]- [28], multi-hop multicast networks [29]- [31], reinforcement learning [32]- [33] and so on.…”
Section: Introductionmentioning
confidence: 99%