Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning

Adjodah, Dhaval; Calacci, Dan; Dubey, Abhimanyu; Goyal, Anirudh; Krafft, Peter; Moro, Esteban; Pentland, Alex

doi:10.48550/arxiv.1902.06740

Cited by 2 publications

(6 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We now establish policy improvement theorem of stochastic TAPE, and prove a theorem for the cooperation improvement from the perspective of exploring the parameter space, which is a common motivation in RL research (Schulman, Chen, and Abbeel 2017;Haarnoja et al 2018;Zhang et al 2021;Adjodah et al 2019). We assume the policy to have tabular expressions.…”

Section: Theoretical Resultsmentioning

confidence: 99%

“…As the global Q value is determined by the centralized critic for all agents, sub-optimal actions of one agent will easily influence all others. Topology in Reinforcement Learning Adjodah et al (Adjodah et al 2019) discuss the communication topology issue in parallel-running RL algorithms such as A3C (Mnih et al 2016). Results show that the centralized learner implicitly yields a fully-connected communication topology among parallel workers, which will harm their performance.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

Lou,

Zhang,

Norman

et al. 2024

AAAI

View full text Add to dashboard Cite

Multi-Agent Policy Gradient (MAPG) has made significant progress in recent years. However, centralized critics in state-of-the-art MAPG methods still face the centralized-decentralized mismatch (CDM) issue, which means sub-optimal actions by some agents will affect other agent's policy learning. While using individual critics for policy updates can avoid this issue, they severely limit cooperation among agents. To address this issue, we propose an agent topology framework, which decides whether other agents should be considered in policy gradient and achieves compromise between facilitating cooperation and alleviating the CDM issue. The agent topology allows agents to use coalition utility as learning objective instead of global utility by centralized critics or local utility by individual critics. To constitute the agent topology, various models are studied. We propose Topology-based multi-Agent Policy gradiEnt (TAPE) for both stochastic and deterministic MAPG methods. We prove the policy improvement theorem for stochastic TAPE and give a theoretical explanation for the improved cooperation among agents. Experiment results on several benchmarks show the agent topology is able to facilitate agent cooperation and alleviate CDM issue respectively to improve performance of TAPE. Finally, multiple ablation studies and a heuristic graph search algorithm are devised to show the efficacy of the agent topology.

show abstract

Section: Theoretical Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

Lou,

Zhang,

Norman

et al. 2024

AAAI

View full text Add to dashboard Cite

show abstract

“…• Fast aggregation via over-the-air computation [21], [84], [85], [86] • Aggregation frequency control with limited bandwidth and computation resources [87], [88], [89] • Data reshuffling via index coding and pliable index coding for improving training performance [90], [91], [92] • Straggler mitigation via coded computing [93], [94], [95], [96], [97], [98], [99], [100], [101] • Training in decentralized system mode [102], [103], [104], [105], [106], [107], [108], [109], [110], [111], [112] Model Partition Based Edge Training Systems…”

Section: Data Partition Based Edge Training Systemsmentioning

confidence: 99%

“…There have been several works demonstrating that some carefully designed topologies of networks achieve better performance than the fully connected network. It has been empirically observed in [107] that using an alternative network topology between devices can lead to improved learning performance in several deep reinforcement learning tasks compared with the standard fully-connected communication topology. Specifically, it was observed in [107] that the Erdos-Renyi graph topology with 1000 devices can compete with the standard fully-connected topology with 3000 devices, which shows that the machine learning performance can be more efficient if the topology is carefully designed.…”

Section: Awgn Receive Beamformermentioning

confidence: 99%

“…It has been empirically observed in [107] that using an alternative network topology between devices can lead to improved learning performance in several deep reinforcement learning tasks compared with the standard fully-connected communication topology. Specifically, it was observed in [107] that the Erdos-Renyi graph topology with 1000 devices can compete with the standard fully-connected topology with 3000 devices, which shows that the machine learning performance can be more efficient if the topology is carefully designed. Considering that different devices may require different times to carry out local computation, Neglia et al [108] analyzed the influences of different network topologies on the total runtime of distributed subgradient methods, which can determine the degrees of the topology graph, leading to the faster convergence speed.…”

Section: Awgn Receive Beamformermentioning

confidence: 99%

See 1 more Smart Citation

Communication-Efficient Edge AI: Algorithms and Systems

Shi

Yang

Jiang

et al. 2020

Preprint

View full text Add to dashboard Cite

Artificial intelligence (AI) has achieved remarkable breakthroughs in a wide range of fields, ranging from speech processing, image classification to drug discovery. This is driven by the explosive growth of data, advances in machine learning (especially deep learning), and easy access to vastly powerful computing resources. Particularly, the wide scale deployment of edge devices (e.g., IoT devices) generates an unprecedented scale of data, which provides the opportunity to derive accurate models and develop various intelligent applications at the network edge. However, such enormous data cannot all be sent from end devices to the cloud for processing, due to the varying channel quality, traffic congestion and/or privacy concerns. By pushing inference and training processes of AI models to edge nodes, edge AI has emerged as a promising alternative. AI at the edge requires close cooperation among edge devices, such as smart phones and smart vehicles, and edge servers at the wireless access points and base stations, which however result in heavy communication overheads. In this paper, we present a comprehensive survey of the recent developments in various techniques for overcoming these communication challenges. Specifically, we first identify key communication challenges in edge AI systems. We then introduce communication-efficient techniques, from both algorithmic and system perspectives for training and inference tasks at the network edge. Potential future research directions are also highlighted.

show abstract

Leveraging Communication Topologies Between Learning Agents in Deep Reinforcement Learning

Cited by 2 publications

References 22 publications

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

Communication-Efficient Edge AI: Algorithms and Systems

Contact Info

Product

Resources

About