2008
DOI: 10.21236/ada528584
|View full text |Cite
|
Sign up to set email alerts
|

No-Regret Learning and a Mechanism for Distributed Multiagent Planning

Abstract: We develop a novel mechanism for coordinated, distributed multiagent planning. We consider problems stated as a collection of single-agent planning problems coupled by common soft constraints on resource consumption. (Resources may be real or fictitious, the latter introduced as a tool for factoring the problem). A key idea is to recast the distributed planning problem as learning in a repeated game between the original agents and a newly introduced group of adversarial agents who influence prices for the reso… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…As far as multi-agent planning is concerned, researchers have previously used Lagrangian relaxation with some success; however, past work often either assumes infinitely divisible resources [15], [4], or tries to get an exact solution and therefore doesn't scale to really large problems [16]. Researchers have also studied restrictions to the type of planning problem considered, in some cases providing strong approximation guarantees using polynomialtime algorithms [17], [1].…”
Section: Resultsmentioning
confidence: 99%
“…As far as multi-agent planning is concerned, researchers have previously used Lagrangian relaxation with some success; however, past work often either assumes infinitely divisible resources [15], [4], or tries to get an exact solution and therefore doesn't scale to really large problems [16]. Researchers have also studied restrictions to the type of planning problem considered, in some cases providing strong approximation guarantees using polynomialtime algorithms [17], [1].…”
Section: Resultsmentioning
confidence: 99%
“…This is in contrast to another large body of works on coordination that focusses on agent interaction via objective functions (e.g. [13,14,6]). Since our model is based on BLPs we can employ our method in the context of particle-based multi-robot open-loop control [5].…”
Section: Introductionmentioning
confidence: 90%
“…No-regret bounds and algorithms have been studied and deployed in a great many online learning scenarios, including, among others, time-series prediction in ARMA models [2], multi-agent coordination [9], game-theory [14,6,21]. For a classic text book, the reader is referred to [11], whereas a recent survey can be found in [18].…”
Section: Contractive Dynamical Systems With Increasingly Permanently ...mentioning
confidence: 99%
“…If a no-external regret bound is achieved then, provided the prediction loss is convex in the parameters, (sub-) optimality guarantees can be given to bound the average prediction errors and the degree of sub-optimality of the parametric predictor whose parameter is the average of all choices of the adapted online predictors' parameters. This has given rise to algorithms that are no longer purely online learning and that decompose learning and prediction into two phases: a learning phase where the online learning method is employed to adapt the learner's parameter online for some time and a subsequent prediction phase making use of the average parameter obtained from the learning phase [11,9].…”
Section: Introductionmentioning
confidence: 99%