2021
DOI: 10.1109/tac.2021.3049345
|View full text |Cite
|
Sign up to set email alerts
|

Finite-Sample Analysis for Decentralized Batch Multiagent Reinforcement Learning With Networked Agents

Abstract: This paper proposes a multiagent based bi-level operation framework for the low-carbon demand management in distribution networks considering the carbon emission allowance on the demand side. In the upper level, the aggregate load agents optimize the control signals for various types of loads to maximize the profits; in the lower level, the distribution network operator makes optimal dispatching decisions to minimize the operational costs and calculates the distribution locational marginal price and carbon int… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
56
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 46 publications
(58 citation statements)
references
References 48 publications
2
56
0
Order By: Relevance
“…It is stated that immigrant students experience negativities such as loneliness or tendency to violence due to their inability to express themselves in social environments. Regarding school attendance problem (Zhang and Basar, 2018) stated that the lack of legislation for refugee students prevents the measures that can be taken against these students. It has been determined that the problem arising from the lack of legislation is the problem of absenteeism.…”
Section: Discussionmentioning
confidence: 99%
“…It is stated that immigrant students experience negativities such as loneliness or tendency to violence due to their inability to express themselves in social environments. Regarding school attendance problem (Zhang and Basar, 2018) stated that the lack of legislation for refugee students prevents the measures that can be taken against these students. It has been determined that the problem arising from the lack of legislation is the problem of absenteeism.…”
Section: Discussionmentioning
confidence: 99%
“…of any non-empty participant is called a coalition. The Shapley value can be used to calculate the profit distributed by participant i, as shown in formulas: (27) and (28). !…”
Section: Cooperative Game Model 1) Profit Calculation Modelmentioning
confidence: 99%
“…In order to solve the optimal quotation problem of thermal power companies under the multi-agent incomplete information game, the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm based on the multiagent reinforcement learning method was proposed [28][29][30][31] . The neural network parameters are updated to simulate the bounded rational process of the game to ensure that the game process is close to reality.…”
Section: Introductionmentioning
confidence: 99%
“…COMA (Foerster et al 2018) constructs a centralized critic and computes an agent-specific advantage function to derive a decentralized actor. FDMARL (Zhang et al 2018) has proposed a distributed learning approach for each agent to learn a global critic using its local reward and the transferred critic parameters from the networked neighboring agents. Because these models directly use the state or observation in constructing critic or actor networks, it is difficult to apply such models to a large-scale environment or transfer them to new environments.…”
Section: Learning-for-consensusmentioning
confidence: 99%