Collaborative Learning with Limited Interaction: Tight Bounds for Distributed Exploration in Multi-armed Bandits

Chao, Tao; Zhang, Qin; Zhou, Yuan

doi:10.1109/focs.2019.00017

Cited by 28 publications

(61 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, when reducing CoPE-KB to prior CoPE with classic MAB setting (all agents are solving the same classic MAB task) [20,38], our lower and upper bounds also match the existing state-of-the-art results in [38].…”

supporting

confidence: 63%

“…In such applications, it is important to develop a more general CoPE model that allows heterogeneous tasks and complex reward structures, and quantitatively investigate how task similarities impact learning acceleration. Motivated by the above facts, we propose a novel Collaborative Pure Exploration in Kernel Bandit (CoPE-KB) problem, which generalizes traditional single-task CoPE problems [20,22,38] to the multi-task setting. It also generalizes the classic MAB model to allow general (linear or nonlinear) reward structures via the powerful kernel representation.…”

mentioning

confidence: 99%

“…Our work distinguishes itself from prior CoPE works, e.g., [20,22,38], in the following aspects: (i) Prior works [20,22,38] only consider the classic MAB setting, while we adopt a high-dimensional RKHS to allow more general real-world reward dependency on option features. (ii) Unlike [20,22,38] which restrict tasks (given arm sets and rewards) among agents to be the same, we allow different tasks for different agents, and explicitly quantify how task similarities impact learning acceleration. (iii) In lower bound analysis, prior works [20,38] mainly focus on a 2-armed case, whereas we derive a novel lower bound analysis for general multi-armed cases with high-dimensional linear reward structures.…”

mentioning

confidence: 99%

“…(ii) Unlike [20,22,38] which restrict tasks (given arm sets and rewards) among agents to be the same, we allow different tasks for different agents, and explicitly quantify how task similarities impact learning acceleration. (iii) In lower bound analysis, prior works [20,38] mainly focus on a 2-armed case, whereas we derive a novel lower bound analysis for general multi-armed cases with high-dimensional linear reward structures.…”

mentioning

confidence: 99%

See 3 more Smart Citations

Collaborative Pure Exploration in Kernel Bandit

Du¹,

Chen²,

Yuroki³

et al. 2021

Preprint

View full text Add to dashboard Cite

In this paper, we formulate a Collaborative Pure Exploration in Kernel Bandit problem (CoPE-KB), which provides a novel model for multi-agent multi-task decision making under limited communication and general reward functions, and is applicable to many online learning tasks, e.g., recommendation systems and network scheduling. We consider two settings of CoPE-KB, i.e., Fixed-Confidence (FC) and Fixed-Budget (FB), and design two optimal algorithms CoopKernelFC (for FC) and CoopKernelFB (for FB). Our algorithms are equipped with innovative and efficient kernelized estimators to simultaneously achieve computation and communication efficiency.Matching upper and lower bounds under both the statistical and communication metrics are established to demonstrate the optimality of our algorithms. The theoretical bounds successfully quantify the influences of task similarities on learning acceleration and only depend on the effective dimension of the kernelized feature space. Our analytical techniques, including data dimension decomposition, linear structured instance transformation and (communication) round-speedup induction, are novel and applicable to other bandit problems. Empirical evaluations are provided to validate our theoretical results and demonstrate the performance superiority of our algorithms.

show abstract

supporting

confidence: 63%

mentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Collaborative Pure Exploration in Kernel Bandit

Du¹,

Chen²,

Yuroki³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…As a team-based and student-centred educational practice, it promotes student motivation and enhances knowledge retention via teamwork and cooperation (Sung & Hwang, 2013). While collaborative learning has been introduced and practiced in co-located settings (Barmaki et al, 2019;Huang et al, 2019;Prinsen et al, 2007;Schneider et al, 2018;Sung & Hwang, 2013), as well as distributed settings (de Freitas & Griffiths, 2007;Li et al, 2008;Schaf et al, 2009;Tao et al, 2019), measuring and evaluating collaboration still remains a challenge. Fairness of group work distribution (Ng et al, 2019), rationality of collaborative conditions (Innes & Booher, 2016) and automatism of process analytics (Rosé et al, 2008) are some of the core issues that need to be considered during collaborative learning analytics, especially in relatively large teams (Bertsimas & Gupta, 2016).…”

Section: Introductionmentioning

confidence: 99%

Deep neural networks for collaborative learning analytics: Evaluating team collaborations using student gaze point prediction

Zhang

Barmaki

2020

AJET

View full text Add to dashboard Cite

Automatic assessment and evaluation of team performance during collaborative tasks is key to the research on learning analytics and computer-supported cooperative work. There is growing interest in the use of gaze-oriented cues for evaluating the collaboration and cooperativeness of teams. However, collecting gaze data using eye-trackers is not always feasible due to time and cost constraints. In this paper, we introduce an automated team assessment tool based on gaze points and joint visual attention (JVA) information drawn from computer vision solutions. We evaluated team collaborations in an undergraduate anatomy learning activity (N = 60, 30 teams) as a test user study. The results indicate that higher JVA was positively associated with student learning outcomes (r(30) = 0.50, p < 0.005). Moreover, teams who participated in two experimental groups and used interactive 3D anatomy models, had higher JVA (F(1,28) = 6.65, p < 0.05) and better knowledge retention (F(1,28) = 7.56, p < 0.05) than those in the control group. Also, no significant difference was observed based on JVA for different gender compositions of teams. The findings from this work have implications in learning sciences and collaborative computing by providing a novel joint attention-based measure to objectively evaluate team collaboration dynamics. Implications for practice or policy: Student learning outcomes can be improved by receiving constructive feedback about team performances using our gaze-based collaborative learning method. Underrepresented and underserved minorities of science, technology, engineering and mathematics disciplines can be engaged in more collaborative problem-solving and team-based learning activities since our method offers a broader reach by automating collaboration assessment process. Course leaders can assess the quality of attention and engagement among students and can monitor or assist larger numbers of students simultaneously.

show abstract

Efficient and robust sequential decision making algorithms

2024

AI Magazine

View full text Add to dashboard Cite

Sequential decision‐making involves making informed decisions based on continuous interactions with a complex environment. This process is ubiquitous in various applications, including recommendation systems and clinical treatment design. My research has concentrated on addressing two pivotal challenges in sequential decision‐making: (1) How can we design algorithms that efficiently learn the optimal decision strategy with minimal interactions and limited sample data? (2) How can we ensure robustness in decision‐making algorithms when faced with distributional shifts due to environmental changes and the sim‐to‐real gap? This paper summarizes and expands upon the talk I presented at the AAAI 2024 New Faculty Highlights program, detailing how my research aims to tackle these challenges.

show abstract

Collaborative Learning with Limited Interaction: Tight Bounds for Distributed Exploration in Multi-armed Bandits

Cited by 28 publications

References 27 publications

Collaborative Pure Exploration in Kernel Bandit

Collaborative Pure Exploration in Kernel Bandit

Deep neural networks for collaborative learning analytics: Evaluating team collaborations using student gaze point prediction

Efficient and robust sequential decision making algorithms

Contact Info

Product

Resources

About