2010
DOI: 10.1016/j.biosystems.2010.04.002
|View full text |Cite
|
Sign up to set email alerts
|

Tug-of-war model for the two-bandit problem: Nonlocally-correlated parallel exploration via resource conservation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
69
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
4

Relationship

4
5

Authors

Journals

citations
Cited by 79 publications
(70 citation statements)
references
References 14 publications
1
69
0
Order By: Relevance
“…Thus, it is possible to consider that there are nonlocal communications among the nodes in our model with instantaneous information transmissions. In [22][23][24], we reported significances of the nonlocal correlations in enhancing the problem-solving performances.…”
Section: Discussionmentioning
confidence: 98%
“…Thus, it is possible to consider that there are nonlocal communications among the nodes in our model with instantaneous information transmissions. In [22][23][24], we reported significances of the nonlocal correlations in enhancing the problem-solving performances.…”
Section: Discussionmentioning
confidence: 98%
“…However, the former does not always coincide with the latter in a more general context. In our previous studies [11][12][13][14], we showed that our proposed algorithm called "tug-of-war (TOW) dynamics" is more efficient than other well-known algorithms such as the modified -greedy and softmax algorithms, and is comparable to the "upper confidence bound1-tuned (UCB1T) algorithm", which is known as the best among parameter-free algorithms [15]. Moreover, TOW dynamics effectively adapt to a changing environment wherein the reward probabilities dynamically switch.…”
Section: Introductionmentioning
confidence: 92%
“…Consequently, the TOW dynamics evolve according to a particularly simple rule: in addition to the fluctuation, if machine k is played at each time t, +1 and −ω are added to X k (t) when rewarded and non-rewarded, respectively (Figure 2a). The authors have shown that these simple dynamics gain more rewards (coins or packet transmissions in cognitive radio) than those obtained by other popular algorithms for solving the BP [11][12][13][14]. Many algorithms for the BP estimate the reward probability of each machine.…”
Section: Tow Dynamicsmentioning
confidence: 99%
“…Such a nonlocal correlation, in the sense that the state-filling induced at one dot immediately alters the dynamics of the entire system, has been shown to enhance the performance in solving the BP. 24 Note also that fluctuation is essential in order to realize an "exploration" ability; that is to say, the probability of the excitation going to a dot irradiated with control light is not perfectly zero if the state-filling does not perfectly inhibit the transition, as mentioned earlier.…”
Section: A Architecturementioning
confidence: 99%